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METHODS FOR ANALYZING ANIMAL PRODUCTS 

The present invention relates to methods for analysing animals and their products. In 
5 particular, the invention relates to methods for differentiating animal products on the 
basis of breed origin, determining or testing the breed origin of an animal product and 
for validating an animal product, as well as to kits for carrying out such methods. In 
addition, the present invention provides methods for the determination of pig genotype 
with respect to coat colour. 

10 

Introduction 

Animal breeds 

For thousands of years, selective pressure has been applied by humans in the course of 
15 animal husbandry to produce livestock exhibiting certain desirable characteristics. 
These characteristics have been selected to meet aesthetic, technical, ritual, social and 
economic needs. The result has been the production of a large number of different 
animal breeds. 

20 The term "breed" is a term of art used to define a homogenous, subspecific group of 
domestic livestock with definable and identifiable external characteristics that enable it 
to be separated by visual appraisal from other similarly defined groups within the same 
species. The term therefore defines a group of animals to which selective pressure has 
been applied by humans to give rise to a uniform appearance that is inheritable and 

25 distinctive with respect to other members of the species. 

As breeds become established, their integrity is maintained by breed societies, 
herdbooks and pedigree records. 

3 0 Breed selection 

Conventional breed selection methods are based on direct measurement of the 
phenotype of an animal and/or its relatives. Thus, the implementation of breeding 
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schemes requires extensive phenotypic record keeping. For example, dairy herd 
improvement programs in the United States and Western Europe relied in part on the 
collection of individual records (milk yield and composition, type traits, health traits, 
etc.) performed on a monthly basis for millions of cows. Likewise, breeding companies 
5 carefully monitor their pig and poultry breeding stock for a whole range of phenotypic 
measurements. 

However, some important characteristics are not immediately apparent at the level of 
the living animal. For example, many parameters of meat quality are determined by 
10 subtle physiological or biochemical characteristics which are not readily apparent and 
so cannot serve as the basis for efficient artificial selection. 

Breeding for qualities of this type has relied in part upon selection for other (more 
readily apparent traits) which are to some extent coinherited (linked or associated) with 
15 the desirable characteristics. For example, in the pig industry lop ears have in the past 
been associated with mothering ability and so have been used as a marker for this trait 

Conventional breed selection methods are limited by the fact that some phenotypes are 
expressed only in one sex or at a specific developmental stage. Moreover, some 

20 phenotypes are difficult and costly to measure. Indirect detection of such phenotypic 
traits via DNA-based diagnosis (for use in marker-assisted selection or MAS) is 
therefore seen as a desirable alternative to direct measurement of phenotypic parameters 
(see Georges and Andersson (1996), Livestock genomics comes of age, Genome 
Research, Vol. 6: 907-921). However, the gene structure-function relationships 

25 underlying many of the desirable traits are often highly complex and not yet sufficiently 
well-established to make such an approach feasible in practice. 

Breed identification 

The definition of animal breeds is currently at a watershed. Whereas previously they 
3 0 have been defined by overt physical characteristics and pedigree records, in the future 
as new breeds are developed from specific breeding lines they will be defined by sets of 
DNA markers. The work described herein allows not only the most accurate approach 
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to breed determination currently possible in a range of products but also allows the 
integration in a common format of breed determinant information obtained through use 
of the present invention with that which will be used in the future. Hie present 
invention therefore allows not only the determination of source breed in the current 
5 environment but also links this to the development of future breeds and their unique 
identification. 

It is generally recognized that the only definitive way to identify a particular animal as a 
representative of a given breed is through its pedigree. Thus, despite the fundamental 

10 importance of overt phenotypic traits in the breeding process and in the maintenance of 
breed purity, those skilled in the art generally consider that breed identity cannot be 
definitively characterized on the basis of visual inspection of such traits. By way of 
example, the genetic factor causing the belt phenotype in pigs is dominant to the non- 
belted form. Thus, a belted animal may result from an animal of a belted breed such as 

15 Hampshire being crossed with a non-belted breed. 

As stated in PIGS A handbook to breeds of the world, V. porter, Helm Inf. Ltd, ISBN 1- 
873403-17-8, 1993, page 16, "What is a Breed?": 

2 0 Appearances can be deceptive: never judge a pig breed by its coat! 

However, in many circumstances breed identification on the basis of direct evidence of 
pedigree is difficult or impossible. Thus, in practice, so-called "breed markers" may be 
used to determine breed identity. 

25 

The term "breed marker" is a term of art which defines a measurable characteristic 
which on the basis of empirical data appears to be breed specific. Breed markers include 
genotypic features such as DNA polymorphisms, chemical features such as protein and 
water contents of meats, epigenetic/biochemical features (such as protein 
30 polymorphisms), chromosome structure, gene copy number, DNA fingerprinting, 
microsatellite analysis and RAPD DNA markers. 



9854360A1 I > 



JVO 98/54360 PCT/GB98/0I531 

y 

* > 

4 

Other useful markers include breed determinants. The term "breed detenninant" is used 
herein to indicate an overt phenotypic characteristic which is used (at least in part) as 
the basis of artificial selection during breeding programmes. It is used in 
contradistinction to the term "breed marker", which (as explained above) is used herein 
5 to define other characteristics which appear to be breed specific on the basis of 
empirical data. The term "breed determinant gene" is used to indicate a gene which is 
involved (at least in part) in the expression of the corresponding overt phenotypic 
characteristic. 



10 Some breed determinants (e.g. coat colour) have traditionally been used as breed 
"trademarks", and so have long served as an indication of pedigree (and breed identity). 
Other breed determinants that have also been selected for in breed development include 
features such as ear carriage, face shape and general anatomical conformation. The 
advantage of breed determinants relative to simple breed markers is the inseverable link 

15 between the characteristics of the breed and the detenninant 

Biochemical and genetic tests for breed identity 

Many of the breed markers discussed above can be characterized using biochemical or 
genetic tests. Such markers include genotypic features (e.g. DNA polymorphisms), 
2 0 biochemical features (e.g. protein polymorphisms), chromosome structure, gene copy 
number, DNA fingerprints, microsatellite patterns and RAPD DNA markers. 

However, there are significant problems associated with such tests, as discussed below: 

25 Tests based on the chemical composition of animal products (e.g. meat or seminal 
plasma) may be compromised by the fact that the chemical profile varies between sites 
in the animal (ie different muscles)and is affected by diet, age, sex and sample storage 
conditions. Moreover, the results obtained are usually quantitative in nature, leading to 
problems with interpretation and comparison between different test sites. 

30 

Tests based on protein polymorphisms are limited by the fact that the distribution of any 
given protein is unlikely to be uniform, so that the protein of interest is absent in certain 
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tissues. Thus, a number of different polymorphism markers may be required to check 
all products of interest for breed provenance. Moreover, such tests are based on 
antibody assays, and a significant investment is also required to develop the reagents for 
a specific antibody test. 

5 

Chromosome structure analyses are compromised by the high level of skill required for 
cytogenetical methodology and interpretation and the elaborate precautions and care 
required for sample preservation. Such markers are poorly applicable on anything but 
materials derived from living or newly deceased animals. 

10 

Classical DNA fingerprinting is based upon regions of repeated DNA sequence that due 
to their structure show a large degree of variation in length within a population. Such 
regions are often present in a number of copies within the DNA of an individual, thus 
increasing the potential for individual variation. By separating fragments of the total 
15 DNA according to size and then defining the position (and so size) of the hypervariable 
region using a specific probe, a fingerprint of a series of bands for a particular 
individual can be obtained. A number of probes for hypervariable regions of DNA have 
been examined in pigs (including Ml 3 viral sequences and human minisatellite probes) 
and it is claimed that specific bands were found in each breed. 

20 

Random Amplified Polymorphic DNA (RAPD) markers are based upon PCR 
amplification of DNA fragments using primers of random sequence. Such reactions 
generally give rise to a number of DNA fragments which can be characterised 
according to size by gel electrophoresis. If the products of reactions based upon DNA 

2 5 from different breeds are examined there is the possibility of finding certain DNA bands 

which are breed specific. However, there is in most cases no direct link between the 
alleles of such repeat series present and the features determining the actual nature of the 
breed. This, combined with the hypervariable nature of these regions of DNA, results 
in them rarely being breed specific (similar alleles being found in a number of different 

3 0 breeds). As there is no link to the phenotype of the breed there is a greater risk that 

cross specific alleles could exist or arise in a breed, whereas this is unlikely with breed 
determinants as they define the phenotype itself. Given the large number of populations 
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of animals of specific breeds that exist, extensive research would have to be carried out 
to exclude a DNA marker from breeds other than that with which it is claimed to be 
linked. 

5 However, a major drawback with this approach is that RAPD markers are considered to 
be unreliable and found to be subject to variation between laboratories. Such problems 
are exacerbated when samples of different types and history must be analyzed and 
compared. 

1 0 There is therefore a need for reliable breed markers which can be used as the basis for 
rapid and inexpensive methods for identifying the breed provenance of various animal 
products and for validating animal products (such as foodstuffs and semen for use in 
breeding programmes). 

15 It has now been recognized that breed determinants as hereinbefore defined (such as 
coat colour) have unexpected advantages as breed identifiers or breed specific markers. 
In particular, it has surprisingly been discovered that the use of overt phenotypic 
characteristics as the basis for selection over long periods of time has led to particular 
alleles becoming fixed in most breeds. Such breed markers can be used to provide 

2 0 industry standard profiles for a particular breed that has application to all materials 

derived from a particular species. 

Thus, it has now been found that many breeds are in fact genetically homogenous with 
respect to breed determinant genes (as hereinbefore defined), so that these genes may 
25 serve as the basis of reliable breed-specific markers (contrary to the prejudice in the art 
mentioned earlier regarding the utility of breed determinants per se, such as coat colour, 
in breed identification). 

Moreover, it has surprisingly been found that the nature of the breed determinant genes 

3 0 (or alleles thereof) underlying any one breed determinant (such as coat colour) may be 

highly polymorphic. Thus, variation in breed determinant genes and/or alleles between 
different breeds may exist, notwithstanding the fact that the different breed determinant 
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genes/alleles may contribute to the expression of the same overt phenotypic 
characteristic. 

Prior to the present invention, it was assumed that the corresponding genetic 
5 determinants would be insufficiently polymorphic to provide a useful basis for 
distinguishing between breeds. For example, coat colour was known to be shared 
among different breeds of pig and (as mentioned above) was therefore not regarded as a 
good candidate for a breed specific marker. However, the present inventors have found 
that the alleles underlying the coat phenotype in such breeds are in fact highly 
1 0 polymorphic and often distinctive (and so useful as the basis for breed identification). 

Similar considerations apply to other overt physical traits (breed determinants), which 
may therefore be shared by different breeds while nevertheless associated with distinct 
genes/alleles in each breed. An example of this is seen in cattle exhibiting the double 

15 muscled phenotype. Work by Kambadur et alia (1997, Genome Research 7, 910-915) 
and Grobet et alia (1997, Nature Genetics 17, 71-74) illustrates that the double muscled 
phenotype of cattle is caused by mutations in the myostatin gene. However, in the 
Belgian Blue and Asturiana breeds, this gene contains an 1 1 bp deletion whereas in the 
Piedmontese breed a G to A transition is present Thus, as with porcine coat colour a 

2 0 single selected characteristic is caused by a number of potential polymorphisms. 
However, the nature of the arisal and selection history for such overt physical 
characteristics leads to the fixation of particular alleles within the breeds contributing to 
the breed specific profile of determinants. 

25 In the light of these findings, it has now been recognized that genetic analysis of breed 
determinants (such as coat colour) provides an effective means for validating animal 
products (e.g. foodstuffs) and may advantageously be incorporated into animal product 
(e.g. food) processing lines to monitor and maintain product quality and into quality 
control protocols in the food industry. 

30 

Coat colour 

Pig breeds show a variety of coat colours and these are often associated with 
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particular production characteristics. For example, white is the predominant coat 
colour among European commercial breeds e.g. Large White and Landrace, and these 
breeds are associated with larger litters and good mothering ability. However, there 
are a number of commercially important coloured breeds, demonstrating a number of 
5 colours and combinations. The Duroc, associated with meat tenderness, is red, the 
Pietrain, a heavily muscled animal which produces a very lean carcass, is spotted, and 
the Hampshire, also heavily muscled, is black with a white saddle over its shoulders. 
In addition, there may be other useful local breeds which have traits of potential 
commercial interest, and which are coloured. For example, the Chinese Meishan 
10 breed has been imported into Europe and the US because of its very large litter size. 
The European Wild Boar is brown when adult and striped when juvenile, and this 
breed is utilised to satisfy consumer demand for traditional meat products. It is also 
claimed that other local breeds or landraces are important because of their adaptation 
to local environments, e.g. temperature, endemic diseases and local feedstufFs. 

15 

Coat colour is important to the pig industry for a number of reasons. Firstly, gross 
variation in appearance (i.e. a range of coat colours) of pigs claimed to be genetically 
consistent for traits other than coat colour can lead to questions about the consistency 
and quality of the animals in the mind of pig-producers. Thus, the coat colour of the 
2 0 pig is often used as a trademark of the breed and the breeders want to ensure that their 
animals breed true for colour. For example, in several markets, local, traditional, 
coloured breeds are marketed for their meat quality or in terms of the production 
system used to rear them. However, this is not a trivial task since the coat colour is 
controlled by a number of genes. The inheritance is also complicated by the presence 

2 5 of dominance and interaction between genes. There is also an application in the 

assessment of the purity of the genetics of traditional breeds used as the basis for 
modem synthetic lines and the confirmation of the derivation of the latter. 

Secondly, in a number of markets there is a preference for white skinned meat. This 

3 0 is due to the fact that pork is often marketed with the skin still attached, and skin from 

coloured pigs, even if dehaired, can still exhibit colour, which can lead to negative 
perception by the consumer partly, since the surface of the meat may appear to be 



SUBSTITUTE SHEET (RULE 26) 



WO 98/54360 PCT/GB98/01531 

9 

spotted by mould. It is therefore necessary in these markets to remove the skin from 
such carcasses, entailing additional cost. For example, in the US, coloured carcasses 
are associated with approximately 1% skin defects requiring dehairing and skinning to 
remove pigment. As a result of this, coloured pig carcasses are generally discounted. 

5 

One example of the problem concerns the presence of black pigmented spots 
occurring in production animals that are crossbreds between a white and a pigmented 
line. This may occur because the dominant white gene inherited from the white breed 
is not always fully dominant in the heterozygous condition which occurs in this cross. 

1 0 A possible solution to this problem would be to ensure that the production animals 
are homozygous for the recessive red allele present in breeds such as the Duroc. In 
this case the pigmented spots would be red instead of black and much less 
conspicuous. To achieve this one needs to breed the recessive red allele to 
homozygosity in both the white and pigmented line used for cross-breeding. 

1 5 However, this would be very difficult using phenotypic selection as selection for a red 
background colour in a white line could only be accomplished with very expensive 
progeny testing schemes. 

In addition, pig breeders would like to be able to be in a position to ensure consistency 
2 0 in breeding populations. Breeders may wish to ensure that progeny produced by 
breeding crosses were always white. Alternatively, a breeder of Duroc or Hampshire 
pigs may wish to ensure that breeding crosses always produced the characteristic 
Duroc or Hampshire colouring. Traditional animal breeding practices have in the 
past, been used to attempt to eliminate untypical colour from pig lines. For example 

2 5 purebred breeders must submit potential boars for progeny testing in order to 

demonstrate that they are suitable for inclusion in the breeding herd. This procedure 
incurs significant cost, including the substantial delay to confirm sufficient matings 
and progeny have been produced before the animal can be used commercially. 

3 0 Therefore, selection based on a diagnostic DNA test for mutations in coat colour 

genes would be a major advance compared with phenotypic selection. 

Coat colour is determined by the action of a number of different gene loci. For 
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example, the gene determining whether a pig is white or coloured is designated / (for 
inhibition of coat colour). The version of the gene preventing the expression of any 
colour (I) is dominant to that which allows colour to develop (/). Traditional selection 
for white animals has reduced the frequency of/, but it still remains in the population of 
5 white heterozygous carrier animals. Recently, a number of structural differences in the 
alleles of the KIT gene were identified and found to be involved with this aspect of coat 
colour determination which allowed the development of methods of distinguishing 
between alleles at this locus. 

10 However, animals which carry two copies of the recessive allele, i, at this locus have 
non-white coat colours (Johansson-Moller et aL, Mamm. Genome, 7:822-830 (1996), 
WO-A-97/05278, the disclosure of which is incorporated herein by reference). Pigs of 
this type can be all one colour, such as the Duroc (which is red), or have combinations 
of colours (particularly spotted or striped or banded patterns, such as the Pietrain and 

15 Hampshire, respectively). Many other combinations are possible and are observed (see 
the table, below): 



Genotvpe 


Colour 


I/I 


White 


I/i 


White 


i/i 


Coloured 


f/f 


White with coloured spots 


f/I 


White 


m 


White with coloured patches 



The non- white colour in such animals may be varying shades of red or black. The type 
of colour expressed is determined by the action of a second gene which is designated E 
(for extension of coat colour). Based on the literature, animals which contain the E 
version of the gene are completely black, and this version of the gene is dominant to 
3 0 that which results in red coat colour (e). Patched or spotted animals, such as the 
Pietrain breed, contain a third version of the gene designated EF. This version of the 
gene is dominant to e but not to E. For example, black animals may have the 
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genotype, UEe, UEEF or UEE. A black sow and a black boar which were both 
heterozygous at the E locus and which were of the genotype UEe would produce both 
black and red piglets in the ratio 3:1. The black piglets would be UEE or UEe and the 
red piglets would be iiee. 

5 

The density and coverage of coat colour and the position of bands of white are 
determined by additional loci, one of which, the belt locus, is discussed later in this 
application. For example, the Hampshire breed is background black with a white band 
across its shoulders, the width of this band may vary, however, the colour should be 

10 black. There is evidence that some Hampshire animals are derived from herds that 
have been crossed previously with red breeds, such as the Duroc. In this situation, the 
red version of the gene can be maintained silently in the heterozygous state. When 
two heterozygotes are crossed 25% of the offspring will contain red. In some cases 
such pigs will have the appearance of the Duroc breed, being solid red, however, in 

15 other cases, the animals will have the white band inherited from the Hampshire and 
have the appearance of red Hampshires. It is the presence of the atypical coat colour 
rather than the pattern that is important in this situation. 

The extension locus is known in other breeds of domestic animals, such as the horse, 

2 0 where e is associated with chestnut colour (Adalsteinsson, J. Hered 65:15-20 (1974)), 

cattle (Klungland et al, Mammalian Genome 6: 636-639 (1995)), the fox 
(Adalsteinsson, J. Hered. 78:15-20 (1987)) and the mouse (Jackson, Ann Rev Genet. 28: 
1 89-217 (1 994)). The extension locus encodes the alpha melanocyte-stimulating 
hormone receptor (aMSHR). It has been shown that recessive alleles at this locus do 
25 not express a functional aMSH receptor (Robbins et a/, Cell, 72: 827-834, Klungland 
et al 9 Mammalian Genome 6: 636-639 (1995)) and these workers have identified 
mutations in the sequence of the aMSHR gene in these species associated with 
different coat colours. 

3 0 Classical segregation analyses have identified a minimum of three alleles at the pig 

extension locus: E for uniform black, E? for black spotting and e for uniform red 
(Ollivier and Sellier, Ann. Genet Sel Anim., 14:481-544, (1982)). The dominance 
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relationship among the three alleles is as follows E>EF>e. We have now found that 
these coat colour variations are associated with sequence polymorphism in the 
ctMSHR gene in the pig. We have analysed the DNA sequence of this gene using 
samples from the following breeds with different coat colour: Wild Boar which is 
5 wild type coloured, Meishan and Hampshire which carry alleles for uniform black 
(£), Pietrain and Large White which carry alleles for black spotting (EF) and Duroc 
which is uniform red (e). In Large White the patches or spots of colour that might be 
expected due to the presence of the EF allele are hidden as this breed also carries the 
dominant white gene which prevents any expression of colour. Five different aMSHR 

10 sequences were obtained one from the Wild Boar, one from Meishan, one from Duroc 
one from Hampshire, and one found in Pietrain and Large White. We have designated 
the allele found in the Wild Boar as E* and assume that the presence of this allele is 
necessary for the expression of the wild type colour. The E alleles for uniform black 
carried by Meishan and Hampshire pigs were associated with different aMSHR 

15 sequences. We have denoted these two alleles ET and respectively. The DNA 
sequence associated with the allele for black spotting found in the Pietrain was 
denoted^. The similarity of the EF and E? alleles suggests that they are derived from 
a common origin. The sequence differences presented here can be used as the basis of 
methods and kits to determine the genotype of pigs in relation to coat colour. 

20 Alternatively, alleles of linked markers, such as microsatellite or AFLP markers, 

found to be in linkage disequilibrium with these alleles could be used to predict colour 
genotype. In conclusion we have found fivedifferent aMSHR sequences associated 
with five different extension alleles i.e. E + y EF and e. 

25 Except for the 2 base pair insertion at the 5' end of the EF allele and the 1 bp deletion 
in the 3' untranslated region of EF* allele the DNA sequence differences identified in 
the aMSHR gene are single base pair changes. Some of these are silent, however, a 
number lead to changes in the amino acid sequence of the aMSHR protein. For 
example, the differences between e and the other alleles are two missense mutations 

3 0 in the coding sequence of the aMSHR gene. Importantly, the differences in the pig 
gene are different from that found in other species. The cattle and mouse e mutations 
are one base pair deletions (Robbins et al, Cell, 72: 827-834 (1993); Klungland et al y 
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Mammalian Genome 6: 636-639 (1995), Joerg et al, Genome 7: 317-318 (1996)), 
whilst the mutations identified here include a rnissense mutation (G727 changed to A) 
in a region which is conserved among human, mouse, cattle and horse gene sequences 
(Wikberg et ah WO 94/04674 (1994), Valverde et al, Nature Genet. 11: 328-330 
5 (1995), Robbins et at, Cell, 72: 827-834 (1993), , Klungland et al 9 Mammalian 
Genome 6: 636-639 (1995), Joerg et al, Genome 7: 317-318 (1996), Marklund et al 9 
Mamm. Genome, 7:895-899 (1996)), The E? group has a dinucleotide insertion in the 
5' end of the gene after nucleotide position 66 of the Wild Boar sequence which leads 
to the creation of a stop codon further into the gene resulting in a predicted mutant 
10 polypeptide of only 54 amino acids. Finally, the Meishan allele {ET) shows four 
amino acid changes in the protein. Two of these differences are in the same region of 
the gene which is altered in cattle. 

The colours of a series of pig breeds, the classical genotypes for / and E and the 
15 determined genotypes for E based on sequencing and testing studies are shown in the 
table below: 



Breed 


J locus 


E locus 


Colour 


Hampshire 


i/i 


ET/ET 


Black with white belt 


2 0 Large White 


I/I 


EF/EF 


White 


Landrace 


I/I 


EF/EF 


White 


Pietrain 


i/i 


EF/EF 


White with black patches 


Berkshire 


i/i 


EF/EF 


Black with white points 


Meishan 


i/i 


ET/ET 


Black or black with white points 


25 Duroc 


i/i 


e/e 


Red 


Wild Boar 


i/i 


ET/ET 


Brown (banded hair) 



3 0 Thus, it is possible to distinguish between the alleles of E* t ET, E? and e and so 
determine the genotype of individual pigs (or the genetic provenance of products 
derived therefrom) with respect to non-white coat colour. Interestingly, the white 
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breeds that have been examined all appear to be fixed for alleles E? at the E locus. 
There is considered to be potentially some modifying effect of the E locus on the 
phenotype conferred by the / locus. While the basis of this is not established, the fixing 
of EF in these lines illustrates the subtle effects on loci involved with coat colour upon 
5 selection for breed characteristics thus providing more determinants among such loci 
than might be expected. 

Associations can be determined between extension locus genotype and linked markers, 
eg microsatellite sequences which are linked to the gene. A number of microsatellite 
10 markers have been located to the region of porcine chromosome 6 to which the 
aMSHR gene has been mapped. 

A number of pig breeds characteristically show the belt phenotype consisting of a 
continuous white belt over the shoulders and white fore legs. Examples of breeds 
15 demonstrating this characteristic are the British Saddleback (derived from the Wessex 
and Essex breeds) and the Hampshire, which show a white belt upon a black back 
ground and the Bavarian Landschwein characterised by a white belt upon a red 
background. The characteristic is controlled by a dominantly acting locus Belt 
designated Be for which there are thought to be two alleles (Legault 1997 in The 

2 0 Genetics of the Pig, Ed Rothschild M.F. and Ruvinsky A, Publ. CAB International) 

(Ollivier and Sellier, Ann. Genet Sel Anim., 14:481-544, (1982)). Be giving rise to a 
belt and be which in the homozygous form leads to the absence of a belt. The 
heterozygous animal Be/be carries a belt but in this genotype the belt is generally 
narrower in character. 

25 

To identify the actual genetic basis of the belted and non belted phenotype studies were 
carried out using animals from a Pietrain x (Pietrain x Hampshire) cross. The Pietrain is 
be/be while the Hampshire is Be/Be. Thus the Fl generation all have the genotype 
Be/be. Further crossing of the Fl back to the Pietrain (be/be)\eads to the segregation of 

3 0 the Be allele between offspring, giving rise to Be/be animals showing belts and be/be 

non belted offspring. A correlation was then established between the inheritance of the 
belted condition and certain microsatellite markers within these pedigrees. This work 
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surprisingly identified the actual gene involved as the ATT gene also described above as 
involved in dominant white. Further analysis showed correlation of the phenotype in 
this pedigree with a polymorphism at KIT nucleotide 2678 with a C or T occurring at 
this position. The presence of a C creates a restriction site for Aci I which is absent 
5 when a T is present. Based upon these unexpected findings a number of approaches 
can be taken to the determination of the genotype for an animal at the belt locus using 
either single nucleotide polymorphisms or linked markers including microsatellites or 
other single nucleotide polymorphisms. Thus animals can be genotyped by a number 
of approaches to determine their genetic status for this particular overt characteristic. 

10 

Summary of the invention 

According to the present invention there is provided a method for differentiating 
animals and animal products on the basis of breed origin, for determining or testing the 
15 breed origin of an animal product or for validating an animal product, wherein the 
method comprises the steps of: (i) providing a sample of the animal product; and (ii) 
analyzing the allele(s) of one or more breed determinant genes present in the sample. 

As explained above, the breed determinant is an overt phenotypic trait. As used herein, 
2 0 an overt phenotypic trait is one which can be visually recognized. 

Differentiation of animal products on the basis of breed origin involves the partition of 
members of a class of different animal products into a number of different products 
sharing the same breed origin. It does not necessarily imply identification of the nature 

2 5 of the breed source. Animal product differentiation on this kind of basis may be 

sufficient where the consistency of source of animal products must be monitored (but its 
actual breed provenance is not important). 

In contrast, determination of the breed origin of an animal product implies identification 

3 0 of the breed source, while testing the breed origin implies analysis sufficient to 

determine whether a breed source other than that desired has been used (without 
necessarily identifying such other breed sources in cases where they are indicated). 
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Validating an animal product implies confirming that it meets stipulated specifications 
as to breed provenance. Such validation may involve differentiation, determination 
and/or testing, depending on the circumstances under which the analysis is performed 
5 and the nature and extent of ancillary data which may be available. 

The sample for use in the invention may be in any convenient form. In many cases, the 
sample will be a sample of a food (e.g. meat product). For most applications, the 
sample is pre-treated (e.g. extracted, purified and/or fractionated) in such a way so as to 

10 make the alleles of a breed determinant gene or genes available for analysis (either at 
the level of nucleic acids (such as RNA or DNA) and/or proteins). The sample is 
preferably a nucleic acid sample, in which case the analysing step (ii) comprises DNA 
or RNA analysis. Alternatively, the sample may be a protein sample (where the nature 
of the protein reflects a breed determinant allele), in which case the analysing step (ii) 

15 comprises protein analysis. 

The breed determinant of the invention may be a monogenic or polygenic trait. 
Monogenic traits are preferred, since the genes conferring such traits are relatively 
easily identified and analyzed. However, in some cases it may be useful to analyze the 

2 0 alleles of polygenic traits (i.e. traits which are controlled by a plurality of genes), since 

the underlying allele polymorphism is often greater in such cases (so increasing the 
potential for breed differentiation). 

Typically, overt phenotypic traits are those traits which have been used as the basis for 
25 artificial selection during the breeding programme. The overt phenotypic trait is 
preferably a behavioural or morphological, physiological or behavioural trait. 

The overt phenotypic trait may vary qualitatively or quantitatively between breeds. 
Preferred are traits which vary qualitatively between breeds, since such traits are often 

3 0 reflected by qualitative differences in the alleles of the corresponding breed determinant 

gene(s). In such cases, analysis yields relatively robust positive-negative results, which 
are easily interpreted and compared between testing stations/laboratories. 
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The breed determinant gene analysed in step (ii) may be any suitable breed determinant 
gene. Such genes may be identified and analysed by methods well known in the art 
using routine trial and error. Preferably, they are selected from any of a coat colour, 
5 pattern, texture, density or length gene; an ear aspect gene; a double muscling gene; a 
horn morphology gene; a tusk morphology gene; an eye colour gene; a plumage gene; a 
beak colour/moiphology gene; a vocalization (e.g. barking) gene; a comb or watde 
gene; and/or a gene controlling display behaviour. 

10 In preferred embodiments, the breed determinant gene is the KIT or aMSHR coat 
colour gene (for example, the pig KIT and/or aMSHR gene). 

The analysis step (ii) may comprise any of a wide range of known nucleic acid/protein 
analytical techniques. The nature of the analytical technique selected is not critical to 
1 5 the practice of the invention, and those skilled in the art can readily determine the 
appropriate technique according to the circumstances in which the analysis is to be 
conducted and the type of data required. 

Preferably, the analysis step (ii) comprises selectively amplifying a specific fragment of 
2 0 nucleic acid (e.g. by PCR), testing for the presence of one or more restriction 
endonuclease sites within the breed determinant gene(s) (e.g. restriction fragment length 
polymorphism (RFLP) analysis), determining the nucleotide sequence of all or a portion 
of the breed determinant gene(s), probing the nucleic acid sample with an allele-specific 
DNA or RNA probe, or carrying out one or more PCR amplification cycles of the 

2 5 nucleic acid sample using at least one pair of suitable primers and then carrying out 

RFLP analysis on the amplified nucleic acid so obtained. 

Alternatively, the analysis step (ii) comprises probing the protein sample with an 
antibody (e.g. a monoclonal antibody) specific for an allele-specific epitope, 

3 0 electrophoretic analysis, chromatographic analysis, amino-acid sequence analysis, 

proteolytic cleavage analysis or epitope mapping. For example the E? allele might be 
distinguished by any method capable of detecting an alteration in the size of the 
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encoded protein. 

In particularly preferred embodiments, the analysis step (ii) comprises determining the 
nucleotide sequence of the KIT and/or aMSHR gene or the amino acid sequence of the 
5 KIT and/or aMSHR protein. Here, the analysis may comprise establishing the presence 
or absence of at least one mutation in the KIT and/or aMSHR gene. Any method for 
identifying the presence of the specific sequence change may be used, including for 
example single-strand conformation polymorphism (SSCP) analysis, ligase chain 
reaction, mutagenically separated PCR RFLP analysis, heteroduplex analysis, 
10 denaturing gradient gel electrophoresis, temperature gradient electrophoresis, DNA 
sequence analysis and non-gel based systems such as TaqMan™ (Perkin-Elmer). 

In the TaqMan™ system, oligonucleotide PCR primers are designed that flank the 
mutation in question and allow PCR amplification of the region. A third 
15 oligonucleotide probe is then designed to hybridize to the region containing the base 
subject to change between different alleles of the gene. This probe is labelled with 
fluorescent dyes at both the 5' and 3' ends. These dyes are chosen such that while in this 
proximity to each other the flourescence of one of them is quenched by the other and 
cannot be detected. Extension by Taq DNA polymerase from the PCR primer 
20 positioned 5' on the template relative to the probe leads to the cleavage of the dye 
attached to the 5' end of the annealed probe through the 5' nuclease activity of the Taq 
DNA polymerase. This removes the quenching effect allowing detection of the 
florescence from the dye at the 3' end of the probe. The discrirnination between 
different DNA sequences arises through the fact that if the hybridization of the probe to 

2 5 the template molecule is not complete (i.e. there is mismatch of some form), then 

cleavage of the dye does not take place. Thus only if the nucleotide sequence of the 
oligonucleotide probe is completely complementary to the template molecule to which 
it is bound will quenching be removed. A reaction mix can contain two different probe 
sequences each designed against different alleles that might be present thus allowing the 

3 0 detection of both alleles in one reaction. 

Although the TaqMan™ system is currently capable of distinguishing only two alleles, 
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labelled probe primer sets could be developed in which the probes for certain target 
allele(s) are labelled with a different fluorescent dye from non target alleles. For 
example, if one wished to confirm that a group of Duroc breed pigs carried only allele e 
one could have a probe present capable of detecting this allele labelled with one 
5 fluorescent dye and probes capable of detecting all the other alleles labelled with the 
second dye. Thus one would detect the presence of any non Duroc type alleles at this 
locus. Such probe sets could be designed and labelled according to the needs of the 
experiment. 

1 0 The analysis step (ii) may further comprise determining the association between one or 
more microsatellite marker alleles linked to the KIT and/or aMSHR gene and to 
particular alleles of the KIT and/or aMSHR gene. 

Alternatively, the analysis step (ii) may be based on the identification of microsatellite 
15 markers present in the nucleic acid sample. 

The analysis step (ii) preferably comprises: (a) determining the association between one 
or more microsatellite marker alleles linked to the KIT and/or aMSHR gene and to 
particular alleles of the KIT and/or aMSHR gene; determining which microsatellite 
20 marker allele or alleles are present in the nucleic acid sample. 

The analysis step (ii) preferably further comprises the step of determining the genotype 
of at least one additional locus, for example an additional breed determinant (e.g. coat 
colour) locus. Particularly preferred as an additional locus is the ATT gene locus (e.g. the 
25 pig KIT gene locus). 

The analysis step (ii) preferably comprises PCR using at least one pair of suitable 
primers. In the case where the gene is the pig aMSHR gene, the at least one pair of 
suitable primers is: 

30 

aMSHR Forward Primer 1: (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGGTGT-3') 
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aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 
aMSHR Forward Primer 2: (5*-CGG CCA TCT GGG CGG GGA GCG TGC-30 
aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); or 
aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3*) 
5 aMSHR Reverse Primer 3: (5*-GGG GCA GAG GAC GAC GAG GGA GAG-3') 

The analysis step (ii) may also comprise restriction fragment length polymorphism 
(RFLP) analysis, for example involving digesting the pig nucleic acid with one or more 
of the restriction enzymes BstUl, Hhal and/or BspHL. In cases where the gene is the pig 

10 aMSHR gene, this analysis may involve identification of a polymorphism at any of the 
nucleotide positions shown to be polymorphic including 283, 305, 363, 370, 491, 727, 
729, 1162 or between nucleotide positions 60 and 70 or between nucleotide 
positonsl005 and lOlOof the pig aMSHR gene. 

15 The analysis step (ii) may involve carrying out one or more PCR amplification cycles of 
the nucleic acid sample using at least one pair of suitable primers and then carrying out 
RFLP analysis on the amplified nucleic acid so obtained to determine the KIT or 
aMSHR genotype of the pig. Here, when the gene is the pig aMSHR gene the at least 
one pair of suitable primers is as defined above. 

20 

The animal product preferably comprises or consists of meat (e.g. processed and/or 
canned meat), egg, egg swab or washing, semen, blood, serum, sputum, wool, biopsy 
sample or leather. It may comprise genomic DNA, RNA or mitochondrial DNA. 

25 The animal is preferably a mammal (e.g. pig, cattle, dog, cat, horse, sheep, rodent or 
rabbit), fish (e.g. salmon or trout) or bird (e.g. chicken or turkey). 

The invention may be used with extremely small samples and can be used to screen 
large numbers of samples quickly and inexpensively. The invention may be adapted to 
30 yield absolute results, and quantification is not essential. Moreover, only small 
fragments of nucleic acid are required, and the same tests can be used on the majority of 
animal products. 
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Applications 

TTie invention finds application in a number of areas. For example, certain breeds are 
considered to yield meat of higher eating quality, and a number of retailers now market 
5 products which claim to be derived from specific or traditional breeds (for example, 
Wild Boar crosses). The invention enables consumer organisations to validate these 
claims and also permits retailers to monitor the quality of the products with which they 
are being supplied (i.e. perform product validation). The invention finds particular 
application in validation studies carried out and used by retailers to support consumer 
10 confidence, since the linkage between a genetic marker and an overt physical feature is 
more readily grasped by the lay person than the concept of breed specific markers. This 
makes the use of such breed determinants attractive and also offers marketing 
opportunities for retailers to underpin validation schemes. 

15 There are also a number of reports of breed influences on the quality of hams produced 
by various meat processing techniques. For example, in one report hams from three 
different pig breeds were reliably classified on the basis of sensory descriptors of 
marbling, saltiness and dry cure flavour. The breed identification processes of the 
invention enables producers to validate raw materials as part of quality control. 

20 

The ability to enforce and validate raw material source uniformity also yields improved 
process control, lower costs and greater product consistency, since it has now been 
found that heterogeneity in chemical composition of products from different breeds is 
an important factor in flavour profile variation and there may also be differences in the 
25 functionality of other meat components between breeds. 

The invention also finds utility in the maintenance of stock purity by animal (e.g. pig) 
breeders. The small size of traditional breed populations means that the maintenance of 
a gene pool of sufficient size to avoid the effects of inbreeding requires the importation 
3 0 and movement of stock between separate populations. A risk of genetic contamination 
is associated with such movements, and the invention may be used to reduce or 
eliminate these risks. The maintenance of biodiversity and the rare breeds providing the 
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reservoir for this diversity provides an increasing need for breed identifiers. There is for 
example a problem for breeders of the British Saddleback. Certain bloodlines of this 
breed carry a higher frequency of the be allele of the belt locus which can result in the 
production of belt-less animals which do not reach the required breed standard and 
5 decrease the value not only of that individual animal but of the whole Utter. The ability 
to select against this allele when new bloodlines are introduced to an existing 
population would enable breeders to increase the genetic diversity with out the risk of 
lowering the relative standard of the particular population to that of the breed in general. 

10 The invention may also be used as part of a breeding programme to confirm particular 
crosses. This may be of enormous value in the establishment of pyramid breeding 
schemes. Particular breed characteristics such as coat colour, body shape and ear aspect 
are often altered in such crosses, yet there is a need to be able to confirm the presence of 
genetics of the desired parents. 

15 

Such visible breed characteristics for the visible confirmation of crosses are also absent 
in the use of artificial insernination, where semen may be supplied from pigs in distant 
geographical locations. 

2 0 In addition, the skilled person will appreciate that based on the information described 

herein, it is possible to provide tests for detemiining pig genotype, with respect to coat 
colour. Thus, the present invention also provides a method of detennining the coat 
colour genotype of a pig which comprises: 

25 (i) obtaining a sample of pig nucleic acid; and 

(ii) analysing the nucleic acid obtained in (i) to determine which allele or 
alleles of the ctMSHR gene are present. 

3 0 In one embodiment of this aspect of the invention the determination in step (ii) is 

carried out by deterrruning the nucleotide sequence of the ctMSHR gene and, in 
particular, is based on determining which missense, insertion or deletion mutation is 
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present in the coding region of the gene. 



In another embodiment one could first determine the association between 
microsatellite or other linked marker alleles linked to the aMSHR gene and particular 
5 alleles of the aMSHR gene. Thus, the determination in step (ii) would be based on 
identification of microsatellite marker alleles present in the nucleic acid sample. 

In a further aspect, therefore, the present invention provides a method of determining 
the coat colour genotype of a pig which comprises: 

10 

(i) determining the association between one or more microsatellite or 
other linked marker alleles linked to the aMSHR gene and particular 
alleles of the aMSHR gene; 

15 (ii) obtaining a sample of pig nucleic acid; and 

(iii) analysing the nucleic acid obtained in (ii) to determine which 
microsatellite or other linked marker allele or alleles are present 

20 The determination of the alleles at the extension locus will indicate the background 
colour of the animal and in some cases the pattern of mixed colouration, i.e. spotting, 
but will not necessarily determine the coat colour of resulting progeny. This will be 
dependent on the genotype at other loci such as the dominant white locus, /. The 
genotype at the / locus can be determined separately as described in WO-A-97/05278. 

25 

Thus, suitably, the methods as described above may further comprise the step: 



9854360A1 ! > 



WO 98/54360 



PCT/GB98/0fr531 



24 

(ui)/(iv) determining the genotype of additional coat colour loci. 

An example of such an additional coat colour locus is the belt locus. 

In a preferred method PCR is carried out using primers that amplify a region of the KIT 
gene containing nucleotide 2678. An example of a suitable pair of primers is: 

forward primer 

LA93 5' - GAGCAGCCCCTACCCCGGAATGCCAGTTGA -3' 
and the reverse primer 

KJT56 5' - CTTTAAAACAGAACATAAAAGCGGAAACATCATGCGAAGG - 

The method of analysis enables deterrnination of the presence of a C or T at position 
2678. Suitably, the restriction enzyme Aril can be used since the presence of a C 
creates a restriction site which is absent when a T is present Similar examinations 
within a pedigree of will allow the determination of the genotype of offspring. 

Thus, in additional aspect, the present invention provides A method of determining the 
coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig nucleic acid: and 

(ii) analysing the nucleic acid obtained in (i) to determine whether the KIT 
gene carries any polymorphism associated with Belt genotype. 



on a 



Preferably, the method comprises RFLP analysis which is suitably carried out 
sample of pig genomic DNA which has been amplified using PCR and a pair of 
suitable primers. 

Preferred methods for identifying the presence of the specific sequence change are 
described above in relation to breed determinants. 
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Brief description of the Figures 
Figure 1 Partial nucleotide sequence (a) and the derived amino acid sequence (b) of 
the porcine aMSH-R gene as determined from a number of pig breeds. Position 
numbers for the nucleotide sequence are based upon nucleotide 1 being the A of the 
ATG initiation codon. Numbers of the amino acids are in accordance with the bovine 
BDF3 sequence (Vanetti et al. FEBS Lett. 348: 268-272 (1995)) to allow comparison. 

Figure 2 : Agarose gel electrophoresis of DNA fragments obtained by digestion of 
DNA fragments amplified from the porcine aMSH-R gene with BstUl or Hhal. Lanes 
labelled M contain DNA markers of 50, 150, 300,500, 750, lOOObp. The other samples 
were derived from: 

1. Pietrain 

2. Pietrain 

3. Large White 

4. Large White 

5. Large White 

6. Duroc 

7. Duroc 

8. Hampshire 

9. Meishan 

10. Berkshire 

11. Berkshire 

Figure 3 : Agarose gel electrophoresis of DNA fragments obtained by digestion of 
DNA fragments amplified from the porcine aMSH-R gene with BstUl (lanes labelled 
B) or Hhal (lanes labelled H). Lanes labelled M contain DNA markers of 50, 150, 
300,500, 750, lOOObp. The other samples were derived from: 

1. Retailer 1. Skin 

2. Retailer 1. Fat 

3. Retailer 1. Muscle 

4. Retailer 2. Fat 
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5. Retailer 2. Muscle 

Figure 4: Electropherogram (4% agarose) showing RT-PCR products of KIT exon 16- 
19 with the primers KIT IF and KIT7R. The samples 1-3 and 4-6 are Swedish Large 
White and Hampshire pigs respectively. The size difference between the 424 and 301 
bp fragments is due to lack of exon 17 in the latter fraction. The two upper bands of the 
Yorkshire pigs were inteipreted as heteroduplexes (HD). 

Figure 5: A 48 bp sequence is shown comprising 21 bp of KIT exon 17 and 27 bp of 
KIT intron 17. The position of the intron/exon border is marked with a vertical line and 
the splice site mutation (ntl °~* A ) indicated with a vertical arrow. Identical bases in 
alleles f and i are marked with a dot 

Figure 6: Main PCR RFLP test used to detect the presence of a splice site mutation in 
intron 17 of the KIT gene. Figure 6 A shows the position of two Mam recognition sites 
within the PCR product amplified using primer pair KTT21 and KIT35. All distances 
are given in base pairs. Figure 6B shows the size of fragments which result following 
MaHI digestion of either normal KIT or splice mutant KIT, Figure 6C illustrates use of 
the PCR RFLP test. Lane 1 shows the KIT21/KIT35 amplified fragment undigested. 
Digestion was performed on PCR products amplified from, in Lane 2: a clone which 
contains the splice site mutation; Lane 3: a clone which contains the normal splice site 
sequence; Lane 4: genomic DNA from a coloured pig; Lane 5: genomic DNA from a 
white pig. Fragment sizes are given in base pairs. 

Figure 7: Comparison of the ratio of normal to splice mutant KIT for three classes of 
genotype. 

Figure 8: Comparison of the ratio of normal to splice mutant KIT for two breeds of pig. 

Figure 9: SSCP analysis of the KIT gene in Swedish Landrace (lanes 1-8) and Wild 
Boar (lanes 9 & 10) breeds. The two polymorphic bands are indicated. 
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Fieure 10 : Nucleotide sequence of the porcine KIT cDNA from an animal of the 
Hampshire breed. The sequence is numbered with the first nucleotide of the N terminal 
methionine codon taken as 1 . 

Figure 11 : Polyacrylamide gel electrophoresis of PCR-RFLP analysis of KIT gene at 
polymorphic nucleotide 2678 in a number of animals. Lanes: 1 & 2, Hampshire Wild 
Boar respectively, both homozygous for the C at position 2678. Lanes 3-7 and 9 & 10, 
unrelated Large White sows all homozygous for T at position 2678. Lane 1 1, a Pietrain, 
homozygous for T at this position and lane 8 a Large White sow heterozygous for C 
and T. Lane 12 contains undigested PCR product and lane M DNA size standards. 

Figure 12 

Nucleotide sequence of the 3' end of the porcine aMSHR coding region and adjacent 
3* untranslated region. The TGA stop codon is highlighted in bold, the primer binding 
sites for EPIG14 is shown in italics. Numbering is based on the system used in figure 
la in which nucleotide 1 is the A of the ATG initiation codon of the Wild Boar 
sequence. Bases in common with the European Wild Boar are marked with a dash. 
Missing bases are marked with a :. 

Examples 

Example 1 : Determination of the sequence of the aMSHR gene 

The DNA sequence of the porcine aMSHR gene was determined through the DNA 

sequencing of a combination of PCR products and cloned portions of porcine DNA. 

Preparation of template DNA for PCR 

DNA can be prepared from any source of tissue containing cell nuclei, for example 
white blood cells, hair follicles, ear notches and muscle. The procedure here relates to 
blood cell preparations; other tissues can be processed similarly by directly 
suspending material in K buffer and then proceeding from the same stage of the blood 
procedure. The method outlined here produces a cell lysate containing crude DNA 
which is suitable for PCR amplification. However, any method for preparing 
purified, or crude, DNA should be equally effective. 

Blood was collected in 50mM EDTA pH 8.0 to prevent coagulation. 50jxl of blood 
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was dispensed into a small microcentrifuge tube (0.5ml Eppendorf or equivalent). 
450jal of TE buffer was added to lyse the red blood cells (haem groups inhibit PCR) 
and the mix vortexed for 2 seconds. The intact white and residual red blood cells 
were then centrifiiged for 12 seconds at 13,000 g in a microcentrifuge. The 
supernatant was removed by gentle aspiration using a low pressure vacuum pump 
system. A further 450jxl of TE buffer was then added to lyse the remaining red blood 
cells and the white blood cells collected by centrifugation as before. If any redness 
remained in the pellet, this process was repeated until the pellet was white. After 
removal of the last drop of supernatant from the pelleted white blood cells, 100|j.l of K 
buffer containing proteinase K was added and the mixture incubated at 55 degrees C 
for 2 hours. The mixture was then heated to 95-100 degrees C for 8 minutes and the 
DNA lysates stored at -20 degrees C until needed. 

Reagents 

T.E. Buffer: 1 OmM TRIS-HC1 pH8.0 

ImMEDTA 

K Buffer: 50mM KC1 

1 OmM TRIS-HC1 pH8.3 
2.5mM MgC12 
0.5% Tween 20 

PCR to produce DNA sequencing template 

The aMSHR gene was amplified for sequence analysis using three primer pairs. 

Primers MSHR Forward Primer 1 : (5'-TGT AAA ACG ACG GCC AGT RGT GCC 
TGG AGG TGT CCA T-3'); and 

MSHR Forward Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3') 
amplify a 428 bp fragment from the 5' half of the gene. 
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Primers MSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG 
TGC-3'); 

and aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3') 

amplify a 405 bp fragment the 3' half of the gene. 

As these two fragments are non-overlapping a third primer pair 

aMSHR Forward Primer 4 (5'-TGC GCT ACC ACA GCA TCG TGA CCC TGC-3'); 

and 

aMSHR Reverse Primer 4 (5'-GTA GTA GGC GAT GAA GAG CGT GCT-3*) 

were used to amplify a 98 bp fragment which spans the 50 bp gap. PCR was carried 
out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume of 20 pi 
containing 25 ng genomic DNA, 1.0 mM MgC12, 50 mM KC1, 10 mM Tris-HCl, pH 
8.3, 200 (M dNTPs, 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 pmol of both 
forward and reverse primer. To activate AmpliTaq Gold, initial heat denaturation was 
carried out at 94 degrees C for 10 minutes followed by 32 cycles each consisting of 45 
sec at 94 degrees C, 45 sec at 53 degrees C and 45 sec at 72 degrees C. The final 
extension lasted for 7 min at 72 degrees C. PCR products were cloned into vector 
pUCl 8 using the SureClone ligation kit (Pharmacia). 

Preparation of plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jets tar plasmid 
midi kit 50 (Genomed) and the resulting DNA diluted to 150 ng/^1. 

Sequencing of plasmid DNA 

Cloned plasmid inserts were sequenced using dye primer chemistry. Each cycling 
reaction was prepared with template and ready reaction mix containing fluorescently 
labelled Ml 3 forward or reverse primer as described in the ABI Prism protocol P/N 
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4021 13 (Perkin Elmer). Cycling and sample pooling was performed using a Catalyst 
800 Molecular Biology Workstation (ABI) following the instruments user manual 
(Document number 903877, Perkin Elmer). The resulting extension products were 
purified, loaded and analysed using the 377 ABI Prism DNA sequencer as described 
by the instrument protocol (Perkin Elmer protocol P/N 402078). 

Dve Ter minator Sequencing of PCR products 

Dye terminator DNA sequencing requires purification of PCR product free from 
excess dNTPs and residual primers. This was achieved by passage of the template 
DNA through QiaQuick spin columns (Qiagen) before the purified DNA was diluted 
to 1 5 ng/ul. Dye terminator cycle sequencing was performed using AmpliTaq DNA 
polymerase FS in accordance with the ABI Prism protocol P/N 402078 (Perkin 
Elmer). Cycle sequencing reactions were performed in a total reaction volume of 10 
til. This comprised 1 .6 pmole of either the forward or reverse primer used to amplify 
the target fragment from genomic DNA, 20 ng of purified template DNA and 
terminator ready reaction mix (Perkin Elmer) which contains each of four dye 
terminators, dNTPs, Tris-HCl (pH 9.0), MgCl 2 , thermal stable pyrophosphate and 
AmpliTaq DNA polymerase FS. Cycle sequencing was performed with a GeneAmp 
9600 machine (Perkin Elmer) over 25 cycles, each consisting of 10 sec at 96 degrees 
C, 5 sec at 50 degrees C and 4 min at 60 degrees C. Extension products were purified 
for gel separation using ethanol precipitation, loaded and run on a 377 ABI Prism 
DNA sequencer as described by the instrument protocol (Perkin Elmer protocol P/N 
402078). 

Results 

The partial coding region DNA sequence of the porcine aMSHR gene sequence fiom 
a number of pig breeds is given in figure 1 a combined with sequence determined in 
example 22. The derived amino acid sequence is shown in figure lb. 

Example 2: PCR-RFLP based discrimination of alleles at the E locus 
DNA prep aration for PCR 
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PCR 

Reactions were set up in a 20(il reaction volume in thin walled 0.25ml tubes (Perkin 
Elmer) with the following components: 

20|al reaction volume: 
2[il template DNA 
1.5mMMgCl 2 
200nM each dNTP, 

3pM each of forward and reverse primers 
0.5 U AmpliTaq Gold (Perkin Elmer) 

MSHR Forward primer 3 sequence: 5' GCA CAT CGC CCG GCT CCA CAA GAC 
3' 

MSHR Reverse primer 3 sequence: 5 f GGG GCA GAG GAC GAC GAG GGA GAG 
3' 

The reaction tubes were placed on a Perkin Elmer 9600 thermal cycler preheated to 94 
degrees C and PCR carried out according to the regime below: - 
Initial denaturation step of 94 degrees C for 10 min. 
33 cycles: 94 degrees C - 45 sees 
53 degrees C - 45 sees 
72 degrees C - 45 sees 
The last cycle is followed by a final elongation of 72 degrees C for 7 min. Samples 
are stored at 4 degrees C until required. 

Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 148bp in length. To test for polymorphism in the 
amplified products the reaction is split into two aliquots of lO^il each of which is 
digested with Hhal (GIBCO-BRL) or BsfUl (New England Biolabs). The reactions 
are set up and incubated as below: 
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BstVl digest Hhal digest 

10pJ amplified DNA 10 M I amplified DNA 

2.5p, BstUl 2.5v. Hhal 

60 degrees C 60 minutes 0.5ul 1 Ox React 2 buffer (GIBCO-BRL) 

37 degrees C 60 minutes 

Following digestion, 2ul of loading dye is added to each reaction (lOOmM Tris 
PH8.0, lOOmM Boric Acid, ImM EDTA, 50% (v/v) glycerol, 0.02% w/v Orange G) 
and the mixes loaded on a 4% agarose gel (3% NuSieve/1% Seakem, FMC 
Byproducts) in 0.5x TBE (44.5mM Tris P H8.0, 44.5 mM boric acid and 0.5mM 
EDTA) and electrophoresed for 1 hour at 150v. 

Products are visualised by ethidium bromide staining. 
Results 

BstUl and Hhal digestion each result in bands of 61 and 87bp. The relationship of 
digestion to the possible allele is as shown in the table below: 

Relationship of restriction dipest profile, tn inHi^ dual a iie^ a t tu„ f i^.. 

Allele Digestion with BstVl Digestion with Hhal 

E+/EF/& Yes ^ 

No 



Yes 



No No 

If the uncut alleles are designated as allele 1 and the alleles digesting with each 
enzyme as allele 2 the various genotypes will be as shown in the table below: 
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Actual E genotypes and associated scores 



Genotype 


BstUl 


Hhal 


EF'/EF 


1/1 


2/2 


ET/EF or ET/E* 


1/2 


2/2 


ET/e 


1/1 


1/2 


EF/EF or Ef/Ef or ET/E? 


2/2 


2/2 


Ef/e or E*/e 


1/2 


1/2 


e/e 


1/1 


1/1 



Note: The results for animals carrying the allele E* will be the same as those carrying 
EF. 

Samples were prepared from a number of pigs and tested according to the above 
protocol. The results are shown in the table below and figure 2 illustrates the patterns 
seen upon electrophoresis. 

E genotypes determin ed for a range of breeds using the BstUUHhal digestion system 



Breed 


No Tested 


Genotype (see note 1) 


aMSHR type 








BstUl 


Hhal 


Hampshire 


9 


EF/EF 


2/2 


2/2 


Large White 


4 


EF/EF 


2/2 


2/2 


Landrace 


1 


EF/EF 


2/2 


2/2 


Pietrain 


3 


EF/EF 


2/2 


2/2 


Berkshire 


2 


EF/EF 


2/2 


2/2 


Bazna 


4 


EF/EF 1 


1/2 


2/2 






EF'/ET 


1/1 


2/2 


Duroc 


4 


e/e 


1/1 


1/1 


Meishan 


3 


Ef"/E m 


1/1 


2/2 



M>fe /. Thegenotype cannot be distinguished from E* or EF in this particular test. 
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As can be seen from the results above the genotypes determined fit with those 
expected from the sequencing data given in figure la for Hampshire, Large White, 
Meishan and Duroc. The additional breeds typed here show the genotypes expected 
from their phenotype and descriptions in published literature (Ollivier and Sellier, 
Ann. Genet. Sel. Anim., 14: 481-544, (1982)). The Pietrain is a white breed with 
black patches of varying extent and has long been considered to be EF (in agreement 
with the result here). The Berkshire, originally a spotted breed, is now a mainly black 
animal with white 'socks 1 again generally considered to be EF as was found here. The 
Landrace is a white animal due to it carrying the dominant white allele at the / locus, 
however its genotype at the E locus has been shown to be EF from classical breeding 
studies. Once again this is in agreement with the results obtained here. The Bazna is 
a Romanian breed having black base colour with a white belt. It was developed from 
the Berkshire and Mangalitza, a Hungarian breed with a number of colour variations 
including black (Porter, Pigs, a handbook to breeds of the world, publ: Helm 
Information, ISBN 1-873403-17-8 (1993)). The ancestory of the Bazna being based 
upon a black breed potentially carrying a similar allele to the Meishan, ET, and the 
Berkshire carrying EF, is in agreement with the alleles found to be present in the breed 
in this work. 

Example 3: Validation of source breeds of retail meats 
DNA preparation 

DNA was prepared from different parts of pork chops from two separate retailers. 
The DNA was prepared from skin (1 retailer only), fat and muscle using the Promega 
Wizard Genomic DNA preparation kit according to the manufacturers instructions. 
Approximately 4mm 3 of each tissue was cut into small fragments for the extraction. 

PCR and Restriction Digest Analysis 

This was carried out exactly as in example 2. 



WO 98/54360 



PCT/GB98/01531 



35 

Results 

The results are shown in figure 3. It can be seen that DNA extracted from a range of 
tissue types can be utilised for this DNA based test with results being obtained here 
for muscle, fat and skin. The genotype of the pig with regard to the (MSHR gene can 
then be determined. In this case the material from both retailers was derived from an 
animal of test type BstUl 1/2 and Hhal 1/2 using the nomenclature as in example 2. 
This translates into genotype EF/e or E*/e. Based on our current knowledge of the 
distribution of the alleles in commercial pig breeds the conclusion can be drawn that 
both source animals contain genetic material derived from the Duroc. 

Example 4: Validation of source breeds of processed meat samples 

Method 

DNA was prepared from heat treated meat samples according to the method of Meyer 
et aL (Journal of AOAC International, 78 1542-1551). Meat samples were minced 
with a scalpel and 0.3g transferred to a sterile 1 .5ml eppendorf tube containing 430|±1 
of extraction buffer (lOmM Tris-HCl pH 8.0, 150mM NaCl, 2mM EDTA, and 1% 
w/v sodium dodecyl sulphate). Fifty microlitres of 5M guanidine hydrochloride and 
20fil of 20mg/ml proteinase K (Boehringer) were added and mixed by inversion 
followed by incubation at 57°C for 3h. After digestion samples were centrifuged for 
10 min at 13,000 x g, and 450^1 of the aqueous phase added to 1ml Wizard DNA 
purification resin (Promega). The mixture was mixed by gentle inversion and 
following the Wizard DNA clean-up procedure carried out according to the 
manufacturers instructions the purified DNA was eluted with 50pl of 70°C water. 1 \A 
of a 1 : 10 dilution was then used as template in a 10|il PCR. 

PCR was carried out as described in the previous example. 

Results 

Meat samples from a Large White based line and a Duroc based line heated at 80°C 
for 30 mins could be differentiated on the basis of their genotype at the E locus with 
the Large White samples giving a pattern characteristic of the E? allele and the Duroc 
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samples a pattern characteristic of the e allele. 
Example 5: Validation of source breeds of semen 

Genomic DNA was isolated from porcine semen, lml of semen was centrifuged for 2 
min at 13,500 x g and the supernatant removed. 1 ml of 2xSSC was added and the 
mix vortexed to resuspend the sperm. The mix was then centrifuged as before and the 
supernatant removed. 400^1 of 0.2M NaOAc pH 7.0 was added and the mix vortexed 
followed by the addition of 34^1 of B-mercaptoethanoI. The mixture was incubated at 
40°C for 30 min followed by the addition of lOO^il of 10% w/v sodium dodecyl 
sulphate and 50|il of 15 mg/ml Proteinase K (Boehringer) and further incubation at 
40°C for 3 hours. 500^1 phenol equilibrated with Tris-HCl pH 8.0 was added and the 
mix vortexed twice followed by centrifiigation at 13,500 x g for 4 min. 400^1 of the 
aqueous phase was removed and 800jil of ethanol added. DNA was allowed to 
precipitate for 5 min at room temperature followed by centrifiigation at 13,500 x g for 
5 min. The pellet was washed with 800fil 70% ethanol v/v and air dried followed by 
resuspension in 200fil of Wizard DNA resuspension buffer (Promega). 1 \il of a 1/1 0 
dilution was used in a 10|il PCR 

PCR was carried out as described in example 2. 

Results 

Semen form a Hampshire based line and a Duroc based line could be differentiated on 
the basis of their genotype at the E locus with the Hampshire samples giving a pattern 
characteristic of the allele and the Duroc samples a pattern characteristic of the e 
allele. 

Example 6: Discrimination of allele E* from alleles EF/E? 
DNA preparation 

DNA was prepared as described in example 1. 
PCR 
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Reactions were set up in a 20pl reaction volume in thin walled 0.25ml tubes (Perkin - 
Elmer) with the following components: 

lOjil reaction volume: 
2\d template DNA 
2.5 mM MgCl 2 
200nM each dNTP, 

5pmol each of forward and reverse primers 
0.5 U AmpliTaq Gold (Perkin Elmer) 

Forward primer sequence: 5 5 CTG CCT GGC CGT GTC GGA CCT G 3' 
Reverse primer sequence: 5' CTG TGG TAG CGC AGC GCG TAG AAG 3' 

The reaction tubes were placed on a Strategene Robocycler and PCR carried out 
according to the regime below: - 
Initial denaturation step of 94°C for 10 min. 
30 cycles: 94°C - 60 sees 

61°C - 60 sees 

72°C - 60 sees 

The last cycle is followed by a final elongation of 72°C for 7 min. Samples are held at 
6°C until required. 

Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 228 in length. To test for polymorphism in the 
amplified products the reaction is digested with BspHl (New England Biolabs). The 
reactions are set up and incubated as below: 

BspHl digest 

10(^1 amplified DNA 

lul lOx React 2 (NEB New England Biolabs) 
0.5 jil deionised water 
5 units BstUl 
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37%: 60 minutes 

Following digestion, 2\i\ of loading dye is added to the reaction (lOOmM Tris pH8.0, 
lOOmM Boric Acid, ImM EDTA, 50% (v/v) glycerol, 0.02% w/v Orange G) and the 
mix loaded on a 4% agarose gel (3% NuSieve/1% Seakem, FMC Byproducts) in 0.5x 
TBE (44.5mM Tris pH8.0, 44.5 mM boric acid and 0.5mM EDTA) and 
electrophoresed for 1 hours at 150v. 

Products are visualised by ethidium bromide staining. 

Results 

BspHl digestion each result in bands of 124 and 104bp. The relationship of digestion 
to the possible allele is as shown below: 

Relationship of restriction digest profiles to individual alleles at the E locus 



Allele 


Digestion with BspHl 


ET/Ef 


Yes 


ET 


No 



Samples were prepared from a number of pigs and tested according to the above 
protocol and the results are shown below: 

E genotypes determined for a range of breeds using the BspHl digestion system 



Breed 


No Tested 


Genotype (see note 1) 


Number 


Wild Boarx 


3 


ET/ET 


3 


Swedish 








Landrace 








Large White 


4 


ET/Ef 


4 


Landrace 


1 


Ef/Ef 


1 


Pietrain 


3 


E p /E p 


3 
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Note 1 ♦ Where the genotype E? is listed this cannot be distinguished from in this 
particular test. 

Example 7: Discrimination of cattle products by breed 

DNA was prepared from cattle muscle samples as described in example 4. PCR was 
then carried out in a lOOpi reaction using the primer pair: 



5 ' -TG AGGTAGGAGAGTTTTGGG-3 * and 
5 ' -TCGAAATTGAGGGG AAGACC-3 * 

as described in Kambadur et aL Genome Research 7: 910-915 (1997) at a 
concentration of 500nM with other reaction components being 2.5mM MgCb, 200|iM 
dNTPs, 50mM KC1, lOmM Tris-HCl pH 8.3, 5 units AmpIiTaq Gold (Perkin Elmer). 
1 |il of bovine genomic DNA was used as template. Denaturation was carried out for 
12 min at 94°C followed by 30 cycles of 94°C for 1 min, 55°C for 1 min, 72° 1.5 min 
followed by 5 min at 72°C. Following PCR 2.0^1 of loading dye (44.5mM Tris pH 
8.0, 44.5mM boric acid, 0.5mM EDTA, 50%w/v glycerol, 0.02% w/v Orange G) was 
added to lO^il of product and analysis carried out by electrophoresis on a 2% agarose 
gel prepared in 0.5x TBE buffer (44.5mM Tris pH 8.0, 44.5mM boric acid, 0.5mM 
EDTA) for 1 hour at 100V. 

The remainder of the PCR was analysed for DNA sequencing using ABI dye 
terminator chemistry as described in example 1 . 



Results 

Bovine mvostatin DNA polymorphisms and related phenotvpe 



Breed Phenotype nt position 941 length PCR product (bp) 

Belgian Blue Double muscle G 482 

Piedmontese normal A 493 

Holstein-Friesian Double muscle G 493 
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Example 8 

RT-PCR of porcine KIT exon 16-19 

L mRNA purification from blood samp les 

Fresh blood samples were collected in citrate tubes from coloured Hampshire pigs and 
Large White pigs. Leukocytes were isolated from 5 ml blood using Ficoll 100 
(Pharmacia Biotech). Isolation of mRNA from leukocytes was then carried out using 
the Quickprep Micro mRNA purification kit (Pharmacia Biotech). The mRNA was 
stored as a precipitate under ethanol at -70°C for up to one month before use in 
reverse transcriptase (RT)-PCR. 

ii.RT-PCR of KIT exon 1 6-1 9 

First-strand cDNA synthesis was accomplished using the First-Strand cDNA 
Synthesis kit (Pharmacia Biotech) so that -100 ng mRNA was randomly primed by 
0.1 ng pd(N6) in a total volume of 15 ul. Two ul of the completed first cDNA strand 
reaction was then directly used per 12 ul PCR reaction by adding 1 0 ul PCR mix 
containing 10 pmol each of the mouse/human derived primers KIT1F and KIT7R (5'- 
TCR TAC ATA GAA AGA GAY GTG ACT C and 5'-AGC CTT CCT TGA TCA 
TCT TGT AG, respectively; Moller et al. 1996, supra}, 1.2 pi 10 x PCR-buffer (10 
mM Tris-HCl, pH 8.3, 50 mM KC1) and 0.5 U of AmpIiTaq polymerase (Perlrin- 
Elmer) incubated with an equal amount Taqstart antibody (Clontech) at 25 °C for 5 
min to achieve a hot start PCR. The reaction was covered with 20 pi mineral oil and 
thermocycled in a Hybaid Touchdown machine (Hybaid) with 40 cycles at 94°C for 1 
min, 55-48 °C (touchdown one degree per cycle the first seven cycles and then 48°C 
in the remaining cycles) for 1 min and 72°C for 1 min. After PCR 2ul loading dye 
was added to each sample which were then loaded on 4% agarose gel 
(Nusieve/Seakem 3:1, FMC Byproducts) and electrophoresed with 100V for 80 min. 
Products were visualised by ethidium bromide staining and UV-Ulumination. 

iii. Clonin g and sequencing of RT-PCR-products 

The RT-PCR products representing ATT exon 16-19 were purified by extraction from 
2% agarose gels using the QIAEX gel extraction kit (QIAGEN) and cloned into the 
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pUC 1 8 vector using the Sureclone ligation kit (Pharmacia Biotech). Plasmids were 
isolated using the QIAFilter plasmid Midi kit (Q1AGEN). Cloned plasmid inserts 
were sequenced using dye primer chemistry. Each cycling reaction was prepared with 
plasmid template DNA and ready reaction mix containing fluorescently labelled Ml 3 
forward or reverse primer as described in the ABI Prism protocol P/N 4021 13 (Perkin 
Elmer). Cycling and sample pooling were performed using a Catalyst 800 Molecular 
Biology Workstation (ABI) following the instruments user manual (Document 
number 903877, Perkin Elmer). The resulting extension products were purified, 
loaded and analysed using the 377 ABI Prism sequencer as described by the 
instrument protocol P/N 402078 (Perkin Elmer). 

iy Results and discussion 

A 424 bp fragment including KIT cDNA exon 16-19 was amplified from all pigs. The 
Hampshire pigs did not show any additional products whereas the Large White pigs 
(eight tested) all showed a 301 bp truncated cDNA fragment (Fig 4). Sequence 
analysis revealed the 424 bp fragment was identical in the two breeds whereas the 
whole exon 17 (123 bp) was missing from the 301 bp fragment Apparent differences 
between individuals regarding the relative amounts of these two products may have 
been caused either by different genotypes containing differing numbers of copies of 
the KIT gene sequence, individual differences in mRNA expression levels or random 
RT-PCR effects. 

The two upper fragments present in Large white pigs represent heteroduplexes 
between the 301 and 424 bp fragments (Fig. 2). This was shown by an experiment 
where these slow migrating fragments were generated by pooling homoduplexes of 
the 424 and 301 bp which were then heat denatured and cooled to 25°C. Moreover, 
cloning of the lower heteroduplex fraction of a Large White pig resulted in clones 
with insert length corresponding to either of the two homoduplexes. 
Example 9 



PCR Amplification an d Sequencing of KIT Exon 17-Intron 17 f5' Splice Site^ 
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L PCR to produce DNA Sequencing Template 

A 175 bp region including the boundary between exonl7 and intronl7 of the KIT 
gene was amplified for sequence analysis using forward primer KIT21 (5* - GTA 
TTC ACA GAG ACT TGG CGG C -3') and reverse primer KTT35 (5* - AAA CCT 
GCA AGG AAA ATC CTT CAC GG - 3'). PCR was carried out on a DNA thermal 
cycler (Perkin Elmer 9600) in a total volume of 20 pi containing 25 ng genomic 
DNA, 1.0 mM MgCl 2 , 50 mM KC1, 10 mM Tris-HCl, pH 83, 200 ^iM dNTPs, 0.5 U 
AmpliTaq Gold (Perkin Elmer) and 10 pmol of both KIT21 and KIT35 primer. To 
activate AmpliTaq Gold, initial heat denaturation was carried out at 94°C for 10 
minutes followed by 32 cycles each consisting of 45 sec at 94°C S 45 sec at 55°C and 
45 sec at 72°C. The final extension lasted for 7 min at 72°C. PCR products were 
cloned into vector pUCl 8 using the SureClone ligation kit (Pharmacia Biotech). 

iL Preparation of Plasmid DNA 

Plasrnid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit (Genomed) and the resulting DNA diluted to 150 ng/^tl. 

iii. Sequencing of plasmid DNA 
DNA was sequenced as in example 8. 

iv. Results 

A portion of the DNA sequence from exon 17 and intron 17 of the KIT gene was 
determined and compared between animals with each of these three alleles. Figure 5 
shows that the / allele carries a splice site mutation at position 1 of intron 17. This G 
to A base substitution is present in one of the two gene copies carried on each 
chromosome. The base substitution occurs in the invariant GT dinucleotide which 
characterises 5' exon/intron boundaries. Analysis of the / allele showed the splice 
site mutation was not present in either the normal (KIT1) or duplicated copy of the 
gene (KIT2). We have found the splice site mutation is unique to the / alleles, and 
therefore makes it possible to distinguish the I-KIT2 sequences. 

Example 10 

not ■« I ^ 



_VVO 98/54360 



PCT/GB98/01531 



43 

Testing For the Presence of the Splice Site Mutation with PCR RFLP 

To easily test for the presence of the G to A splice site mutation, restriction 
endonuclease Main (CATG) was used to exploit the point substitution identified at 
position 1 of intron 17 (Figure 5). The Main recognition sites in the fragment 
amplified from KIT and the expected restriction products are illustrated in Figure 6A 
and 6B respectively. 

I PCR to produce DNA for RFLP Test 

The PCR to produce DNA for RFLP analysis was performed exactly as described in 
example 9. 

ii. Restriction Enzyme Digestion and Electrophoresis 

The PCR amplification product is 175 bp in length. To test for polymorphism at 
position 1 of intron 1 7, digestion reactions were set up as below: 

3.0 ill PCR amplified DNA 
1.0 ^1 10XNEBuffer4 
O.l^lBSAlOO ^g/ml 
0.1 *il Mam 10U/pl 
5.8 ^1 dH20 

(1 X NEBuffer 4 (New England Bioiabs) contains 50 mM potassium acetate, 20 mM 
Tris acetate, 10 mM magnesium acetate and 1 mM DTT). Following incubation at 
37°C for 90 minutes each 1 0 fil reaction volume had 2 |xl of loading dye added and 
the mix loaded on a 8% native polyacrylamide gel (Protogel, 37.5:1 
acrylamide:bisacrylamide , National Diagnostics, Atlanta) in 0.5 X TBE (44.5 mM 
Tris pH 8.0, 44.5 mM boric acid and 0.5 mM EDTA) and electrophoresed for 3 hours 
at 200V in a vertical slab unit (SE600 Hoefer Scientific Instruments). Products were 
visualised by ethidium bromide staining. 



iii Results 
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A PCR RFLP protocol was designed to test for the presence of the splice site mutation 
as the substitution occurs within the recognition site for restriction endonuclease 
Mam. Figure 6B illustrates that presence of the G to A base substitution at position 1 
of KIT intron 17 results in restriction at each of two Malll recognition sites within the 
175 bp DNA fragment. Following electrophoresis, this results in fiagments of sizes 80 
bp, 54 bp and 41 bp. Where the splice site mutation is absent however, incubation 
with Main results in digestion only at recognition site 1. Following electrophoresis 
this results in fragments of 134 bp and 41 bp. The invariant Main recognition site 1 
serves as an internal control to ensure complete digestion has taken place. Results of 
this PCR RFLP analysis are illustrated in Figure 6C. Analysis was performed on 
fragments amplified from clones which either carry the splice site mutation (lane 2) or 
carry the normal splice site sequence (lane 3). Lane 4 shows the result of analysis 
where DNA amplified from the genomic DNA of a coloured animal was used. Lane 5 
shows the resulting bands where a white animal was tested. The test was used to 
analyse 121 individuals from seven different breeds of pig. The splice site mutation 
was found only in the 97 animals with the dominant white phenotype (//- or /*//) and 
none of the 24 coloured (/ or i) examples (see table below). This analysis confirms I 
and /* to be unique in that they are the only alleles to carry the splice site mutation. 

Distribution of the Splice Site Mutation Between Different Breeds and Coat 
Phenotype 



Breed 



Coat 
Colour 



Assumed Animals Normally spliced Splice 
Genotype 1 Tested KIT 2 Mutation 2 



Large White 

Landrace 

Hampshire 

Duroc 

Pietrain 

Meishan 

Wild Boar 

Wild Boar 

x Large 

White 

Totals 



coloured 
coloured 
coloured 
coloured 
coloured 
white 



white 
white 



I/- 
I/- 
i/i 
i/i 
i/i 
i/i 
i/i 
I*/- 



33 
56 
5 
5 
8 
5 



33 

56 

5 

5 

8 

5 

1 

8 



33 
56 
0 
0 
0 
0 
0 
8 
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white 


V- 


89 


89 


89 


white 


I*/- 


8 


8 


8 


coloured 


i/i 


24 


24 


0 



White animals may be homozygous or heterozygous for the / allele 
2 Presence of the splice site mutation determined by NlalU PCR RFLP test 

Example 1 1 

Quantification of Normal KIT and Splice Mutant KIT flntron 17 ntl G ~* A> > 
As the splice site mutation is present in only one of the duplicated regions of /and not 
in the duplicated region of f 9 the various genotypes can be expected to have the 
attributes described in the table below: 



Genotype Copies of Normal Copies of KIT Ratio of normal KIT to 



KIT containing the splice splice mutant KIT 

mutation 

I/I 2 2 1:1 

I/i 2 1 2:1 

i/i 2 0 2:0 

W* 3 1 3:1 

I P /i 3 0 3:0 



Due to the dominance of allele /, three of the genotypes in the table are carried by white 
animals and therefore can not be identified by phenotypic characterisation. 
Quantification of the relative amounts of the normal KIT gene and the splice mutant 
KIT gene allows the ratio between the two to be calculated, and therefore the genotype 
of individual animals predicted. This was achieved by quantification of two DNA 
fragments following NlaUl digestion. The amount of 1 34 bp fragment, representative of 
the normally spliced KIT gene, and of 54 bp fragment, representative of the splice 
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mutant KIT, were measured following electrophoresis using GeneScan software. 
L PCR to Produce DNA for Quantification 

As described in example 9 section i. The reverse primer KIT35 is labelled with the ABI 
fluorescent dye FAM at the 5' end. 

ii Restrictio n FnTvme Digestion 
As described in example 9 section ii. 

m Electrophoresis and Q uantification of DNA Fra prnpntc 

Following digestion, 0.5 ul of the reaction volume was mixed with 2.5 ul of deionised 
formamide, 0.5 pj of GS350 DNA standard (ABI) and 0.4 ul blue dextran solution 
before being heated to 90°C for 2 minutes and rapidly cooled on ice. Three ul of this 
mix was then loaded onto a 377 ABI Prism sequencer and the DNA fragments 
separated on a 6% polyacrylamide gel in 1 X TBE buffer for 2 hours at 700 V, 40 mA, 
32 W. The peak area of fragments representative to both the normal and splice mutant 
forms of KIT were quantitated using the GeneScan (ABI) software. 

iv. Ratio Calculations 

The peak area value of the 134 bp fragment (normal KIT) was divided by twice the 
peak area value of the 54 bp fragment (splice mutant KIT) in order to calculate the ratio 
value for each sample. 

y. Results 

Analysis was performed on animals from the Swedish wild pig/Large White intercross 
pedigree for which genotypes at / have been determined by conventional breeding 
experiments with linked markers. Figure 7 and the table below show the ratio of normal 
to mutant KIT calculated for animals from each of the three genotype classes, VI 
(expected ratio 1 :1), Vi (expected ratio 2:1) and Uf (expected ratio 3:1). The results are 
entirely consistent with the expected ratio values and indicate that the three genotype 
classes can be distinguished using this method. 

Ratio of the Two KIT Forms in Different Dominant White Genotypes in a Wild 
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Pig/Large White Intercross 



Genotype 


Phenotype 


Expected 

Ratio 
(Normal: 
Mutant) 


Observed Ratio 
(Normal.Mutant) 

±SE 


Number 
Tested 


I/I 


white 


1:1 


1.15 ±0.075 


13 


I/I p 


white 


3:1 


3.11 ±0.084 


12 


I/I 


white 


2:1 


2.23 ±0.109 


14 



Figure 7 illustrates that the range of ratio values calculated for the two genotypes I/I 
and I/f do not overlap. This enables animals carrying the f allele to be identified and 
the frequency of the allele within different pig breeds determined. Ratio values were 
calculated for 56 Landrace and 33 Large White animals and the results are shown in 
Figure 8. A clearly bimodal distribution is observed with 7 Landrace and 3 Large 
White individuals having a ratio value of approximately 3 or above, suggesting them 
to be heterozygous carriers for the f allele (genotype I/f\ This means f has gene 
frequency estimates of 6.25% (7/1 12 chromosomes tested) and 4.5% (3/66 
chromosomes tested) within the Landrace and Large White breeds respectively. 

EXAMPLE 12 

(i) DNA Prep aration 

DNA can be prepared from any source of tissue containing cell nuclei, for example 
white blood cells, hair follicles, ear notches and muscle. The procedure outlined here 
relates to blood cell preparations; other tissues can be processed similarly by directly 
suspending material in K buffer and then proceeding from the same stage of the blood 
procedure. The method outlined here produces a cell lysate containing crude DNA 
which is suitable for PCR amplification. However, any method for preparing 
purified, or crude, DNA should be equally effective. 

Blood was collected in 50 mM EDTA pH 8.0 to prevent coagulation. 50 *il of blood 
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was dispersed into a small microcentrifuge tube (0.5 ml Eppendorf or equivalent). 
450 ul of TE buffer was added to lyse the red blood cells (haem groups inhibit PCR) 
and the mix vortexed for 2 seconds. The intact white and residual red blood cells 
were then centrifuged for 12 seconds at 13,000 g in a microcentrifuge. The 
supernatant was removed by gentle aspiration using a low pressure vacuum pump 
system. A further 450 ul of TE buffer was then added to lyse the remaining red blood 
cells and the white blood cells collected by centrifugation as before. If any redness 
remained in the pellet, this process was repeated until the pellet was white. After 
removal of the last drop of supernatant from the pelleted white blood cells, 100 ul of 
K buffer containing proteinase K was added and the mixture incubated at 55°C for 2 
hours. The mixture was then heated to 95-100°C for 8 minutes and the DNA lysates 
stored at -20°C until needed. 

10 mM TRIS-HC1 pH 8.0 
1 mM EDT A 
50 mM KC1 

10mMTRIS-HClpH8.3 
2.5 mM MgCl 2 
0.5% Tween 20 

Prior to use for lysates, 10 ul of 20 mg/ml proteinase K (Molecular Probes Inc.) per 
1 .0 ml of K buffer was added. 

(ii) PCR 

Reactions were set up as follows in thin walled 0.25 ml tubes (Perkin Elmer): 

4.0 ul 5 uM CRC Forward primer; 
4.0 ul 5 uM CRC Reverse primer; 
4.0 ul 5 uM A7T1-REV primer; 
4.0 ul 5 uM KITl-FOR primer; 



Reagents 
TE buffer: 

K buffer. 
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4.0 nl 2 mM dNTPs (Pharmacia); 
4.0 pd 35 mM MgCl 2 . 

A wax bead (PCR Gem 50, Perkin Elmer) was added and the tube placed in a Perkin 
Elmer 9600 thermal cycler. The tube was then raised to 80°C for 15 seconds followed 
by cooling to 4°C. A second set of reagents was then added to each tube as below: - 

4.0 ill lOx buffer; 

9.6 fil sterile deionised water, 

0.4 nl (0.5 units) AmpliTaq DNA polymerase (Perkin Elmer); 
2 DNA lysate. 

Reaction tubes were then placed on a Perkin Elmer 9600 thermal cycler preheated to 
94°C and PCR carried out according to the regime indicated below: - 

94°C for 4 minutes; 

20 cycles of 94°C for 30 sees, 62°C for 30 sees and 72°C for 30 sees; 
0°C until required. 

The number of cycles may vary depending upon the tissue used as the DNA source. 
ATT primers 

Forward GAATATTGTTGCTATGGTGATCTCC £771 -FOR 
Reverse CCGCTTCTGCGTGATCTTCCTG KJTTX -REV 

CRC primers 

Forward CTGGATGTCCTGTGTTCCCTGT CRC-FORWARD 
Reverse AGGTTTGTCTGCAGCAGAAGCTC CRC-RE VERSE 

The reverse ATT primer and the forward CRC primer are labelled with the ABI 
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fluorescent dye FAM at the 5' end. 

(iii) Electrophoresis and Quantitation of DNA Fragments 

1 [il of the PGR was mixed with 2.5 p.1 of deionised fonnamide, 0.5 [il of GS350 
DNA standards, 0.4 jxl blue dextran solution, heated at 90°C for 2 minutes followed 
by rapid cooling on ice. 3 jxl of this mix were then loaded onto an AB 1373 DNA 
sequencer and DNA fragments separated on a 6% polyaciylamide gel in 1 x TBE 
buffer for 2 hours at 700 V, 40 mA, 32 W. The fragments corresponding to the 
products from the ATT and CRC genes were quantitated using GeneScan software, the 
peak area for each of the bands being determined. 

(iv) Results 

The data given in the table below represents the results obtained from an experiment 
in which DNA lysates were produced from each of 23 animals, with two PCR tests 
being carried out on each lysate. The ratio of ATT peak area to CRC peak area was 
calculated for each PCR and the average taken of those samples from the same 
animal. 



Animal 


Genotype 


KTT/CRC peak 
area ratio 


I 


n 


3.25 


2 


Ii 


2.45 


3 


ii 


2.94 


4 


ii 


1.16 


5 


ii 


1.34 


6 


ii 


1.20 


7 


Ii 


2.18 




Ii 


2.19 


9 


II 


2.88 


10 


ii 


1.30 


11 


Ii 


1.84 | 


1 12 


II 


2.84 


1 13 


ii 


1.50 1 
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14 


ii 


1.30 | 


15 


Ii 


2.07 


16 


ii 


1.31 


17 


ii 


1.14 


18 


Ii 


2.02 


19 


Ii 


1.87 


20 


Ii 


2.00 J 


21 


ii 


0.99 


22 


ii 


1.15 


23 


n 


2.80 J 



The upper and lower limits for the ratio values from animals of the different 
genotypes II, Ii and ii in this experiment are as below: 

Genotype Upper Limit Lower Limit 

/// 3.25 2.80 

Hi 2.45 1.84 
Hi L50 0.99 

These results illustrate differentiation of the genotypes using this test 



EXAMPLE 13 

The second test utilises unique sequences of DNA that are present at one end of the 
duplication (or both ends if the duplicated region is reversed relative to the rest of 
the gene or if the duplicated region does not occur in direct tandem with the non- 
duplicated region). Oligonucleotide primers for use in PGR are designed such that 
at the annealing temperatures used in the PCR process, they will anneal only to the 
junction regions at the end of the duplicated region. A PCR is then carried out 
using two pairs of oligonucleotides. One pair consists of the aforementioned 
primer spanning the junction region and a second primer a suitable distance away 
which allows amplification to occur only from / allele containing duplication. The 
second pair of primers allow amplification of a sequence present only as a single 
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copy in the haploid genome. The product of this reaction, carried out in the same 
tube, functions as an internal standard as in the previous test. The ratio of product 
from the reaction specific to the junction region is measured relative to that from 
the single copy control sequence. 

In this test there is a larger difference between the predicted ratios of the products 
from the different genotypes. The relative levels of product and their ratios are 
illustrated below: - 

Junction Control 
Genotype Product Product Ratio 

II 2 2 1:1 

K 1 2 1:2 

ii 0 2 0:2 

These larger ratios allow greater differentiation between the ranges of results 
obtained from the different genotypes, reducing risks of miss-scoring animals. 

EXAMPLE 14 

(i) DNA Preparation 

DNA can be prepared as described in example 12. 

(ii) PCR 

Reactions were set up as follows in thin walled 0.25 ml tubes (Perkin Elmer): 

2.0 ul 5 mM A77DEL2-FOR primer; 
2.0 ul 5 mM A77DEL2-REV primer; 
1 .0 ul 2 mM dNTPs (Pharmacia); 
1.2 uI25mMMgC12 
2.0 ul lOx buffer (without MgC12) 

0.1 ul (0.5 units) AmpliTaq DNA polymerase (Perkin Elmer); 

2.0 ul DNA lysate; 

9.7 ul sterile deionised water. 
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Reaction tubes were then placed on a Perkin Elmer 9600 thermal cycler and PCR 
carried out according to the regime indicated below:- 

95°C for 1 minute; 

3 cycles of 95°C for 15 sees, 50°C for 20 sees and 72°C for 40 sees; 
27 cycles of 94°C for 15 sees, 50°C for 20 sees and 72°C for 50 sees; 
72°C for 5 minutes; 
4°C until required. 

The number of cycles may vary depending upon the tissue used as the DNA 
source. 

ATT primers 

Forward GAAAGTGA(C/T)GTCTGGTCCTAT(C/G)GGAT A7TDEL2- 

FOR 

Reverse AGCCTTCCTTGATCATCTTGTAG A77DEL2-REV 

(iii) Electrophoresis 

1 of the PCR product was mixed with 3 |xl loading buffer (95% deionised 
fonnamide, lOmM NaOH, 20mM EDTA, 0.05% bromophenolblue, 0.05% 
Xylene-cyanol), heated to 95°C for 3 minutes followed by rapid cooling on ice. 
The sample was then loaded on an 8% native polyacrylamide gel (Protogel, 37.5:1 
Acrylamiderbisacrylamide, National Diagnostics, Atlanta) in 1 x TBE buffer 
(89mM Tris, 89mM boric acid, 2mM EDTA.Na2). The DNA fragments were 
separated by electrophoresis for 4.5 hours at 6W with a constant temperature of 
20°C and 0.6 x TBE as running buffer in a vertical slab unit (SE600 Hoefer 
Scientific Instruments, San Francisco). 

(iv) Visualisation of DNA fragments by silver staining 

After electrophoresis the gel was incubated, with gentle agitation, in the fix 
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solution for 20 minutes or until the tracking dyes were no longer visible. The gel 
was rinsed three times (2 minutes each with agitation) in deionised water. The gel 
was then incubated in the staining solution for 40 minutes, with gentle agitation, 
followed by a brief wash (5-10 seconds) in deionised water and direct transfer to 
the developing solution. The gel was incubated in the developing solution until 
bands were clearly visible and then the development was terminated by adding an 
equal volume of fix solution. Finally, the gel was rinsed for 2 minutes in 
deionised water. 

Reagents 

Fix solution: 10% glacial acetic acid in deionised 

water 

Staining solution: 2 g silver nitrate (AgN03) 

3 ml 37% formaldehyde 
2 litres deionised water 

Developing solution: 60 g sodium carbonate (Na2C03) dissolved in 2 liters 

deionised water. Immediately before use add 3 ml 37% 
formaldehyde and 400 ml sodium thiosulfate (10 mg/ml). 
The solution should be at a temperature of 10-12°C when 
used. 

(v) Results 

This SSCP analysis reveals an informative polymorphism so far only found in 
animals with the dominant white phenotype (Fig. 9). In lanes 1 to 8 the analysis 
was carried out on DNA from Swedish Landrace pigs carrying the dominant white 
colour and in lanes 9 and 10 DNA was from wild pigs of wild type colour. The 
polymorphic bands are indicated. The polymorphism is characterised by two 
unique fragments only present in animals carrying a duplicated KIT gene of allele 
type /. The fragments represent heteroduplexes of DNA strands from PCR products 
of unequal length representing the duplicated and non-duplicated copy of the KIT 
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gene. The results of a screening test with this marker using 40 unrelated animals 
representing five breeds and 190 F2 animals from a Large White/Wild pig 
intercross are presented in the table below: 



DDCCn 

DKeeLI 


LULUUK 


WU. Ur 
ANIMALS 


HETERODUPLEX J 








PRESENT 


NOT B 
PRESENT 


SWEDISH 
LANDRACE 


WHITE 


10 


10 


0 


SWEDISH LARGE 
WHITE 


WHITE 


8 


8 


0 


SWEDISH 
HAMPSHIRE 


COLOURE 
D 


10 


0 


10 


SWEDISH 
DUROC 


COLOURE 
D 


10 


0 


10 


WILD PIG 


COLOURE 
D 


2 


0 


2 


LARGE WHITE/ 
WILD PIG 
INTERCROSS 


WHITE 
PATCH 
COLOURE 
D 


131 

9 
50 


106 

0 

0 


25 

9 

50 



The results show that this particular polymorphism is very closely associated with 
the presence of the ATT duplication. It is not completely associated with the 
duplication as some white animals did not show the heteroduplex pattern. The 
polymorphism is therefore an example of a closely linked genetic marker which by 
itself or in combination with other linked markers can be used to differentiate 
genotypes as regards the dominant white coat colour. 

EXAMPLE 15 

i) DNA extraction 
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DN A was prepared as in example 12. 
ii)PCR 

Reactions were set up in 0.25ml thin walled reaction tubes (Perkin Elmer) as 
follows: 

0.5ul 5 uM A77DEL 1 -FOR primer 

0.5ul 5 A77DEL1-REV primer 

1 .Oul 2mM dNTPs (Pharmacia) 

l.Oul 15mMMgCl 2 

l.Oul 1 OX buffer 

4.9ul Sterile distilled water 

0. 1 ul AmpliTaq DNA polymerase 

l.Oul DNA lysate 

Reaction tubes were then placed in a Perkin Elmer 9600 thermal cycler and PCR 
carried out according to the regime 

94°C for 4 minutes; 

21 cycles of 94°C for 30 sec, 60°C for 30 sec, and 72°C for 30 sec 
72°C for 4 min; 
4°C until required. 

The number of cycles used may vary depending on the tissue used as the source of 
the DNA. 

Primers 

forward TGTGGGAGCTCTTCTCTTTAGG A77DEL1-FOR 

reverse CC AGC AGGAC AATGGG AACATCT KITDEL 1 -REV 

The reverse primer was labelled with the ABI fluorescent dye FAM at the 5' end. 
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iii) Electrophoresis and quantitation of DNA fragments 

1 |xl of the PGR was mixed with 1 .5jil of deionised formamide, 0.25|il of GS350 
DNA standards, 0.25fii loading buffer (50mg/ml blue dextran, 25mM EDTA) and 
heated at 90°C for two minutes followed by rapid cooling on ice. 1 .75^1 of this was 
then loaded onto an ABI 377DNA sequencer and DNA fragments separated on a 
4.12% polyacrylamide gel in Ix TOE buffer for two hours at 3000V, 60mA, 
200W and 48°C. The 97bp and 93bp fragments corresponding to the products from 
the ATT gene template lacking the deletion and containing the deletion respectively 
were quantitated using GeneScan software, the peak area for each of the bands 
being determined. 

Results 

The data given in the table below represents the results obtained from an 
experiment in which DNA lysates were produced from each of 20 animals of 
known genotype with one PCR test being carried out on each lysate. The ratio of 
the peak area of the product from the DNA template not containing the four base 
pair deletion to that containing the deletion was calculated. 



ANIMAL 


GENOTYPE 


Non del/del 
peak area ratio 


1 


11 


1.347 


2 


II 


1.21 


3 


II 


1.33 


4 


II 


2.267 


5 


II 


0.444 


6 


II 


0.713 


7 


II 


8.387 


8 


II 


0.994 


9 


II 


1.673 


10 


II 


1.056 J 


11 


Ii 


1.751 1 


12 


Ii 


1.73 1 
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I 13 


Ii 


1.83 


1 14 


Ii 


0.631 


I 15 


Ii 


1.975 


I 16 


Ii 


2.147 


17 


Ii 


1.901 


18 


Ii 


1.749 


19 


Ii \ 


2.103 


20 1 


Ii 


2.026 



For this small sample the value of 1 .5 which is midway between the predicted ratio 
values for each genotype (expected ratio=2 for Ii and 1 for II) might be used as the 
dividing line for scoring the animals to either genotype. It can be determined from 
the table that 7/10 Hand 9/10 Ii are identified as the correct genotype. 



Example 16 

Sequencing of KIT cDNA clones 

mRNA was isolated from peripheral blood leukocytes from white (Landrace/Large 
White) and coloured (Hampshire) pigs using the Message Maker mRNA isolation 
system (Gibco BRL) with one mRNA selection from total RNA. lOOng poly(A) + 
mRNA was reverse-transcribed with random primers (First-Strand cDNA Synthesis 
kit, Pharmacia Biotech) and the product was used at a 1:10 dilution for RT-PCR 
using the proof-reading Advantage KlenTaq Polymerase (Clontech) according to the 
manufacturer's recommendation. The following primers were used to amplify almost 
the entire coding sequence and some of the 5' untranslated region: KIT40 (5'-GGC 
TCT GGG GGC TCG GCT TTG C) corresponding to the 5'untranslated region and 
KIT22S (5>- TCA GAC ATC TTC GTG GAC AAG CAG AGG) corresponding to 
exon 21; both primers had been designed using consensus sequence of the human 
and mouse KIT sequences in the GENBANK database. The RT-PCR products were 
gel purified and cloned using the pGEM-T vector system (Promega). Plasmid clones 
were sequenced using a set of internal primers and the ABI Prism™ dRhodamine 
Terminator Cycle Sequencing Kit (PE Applied Biosystems). Two subclones 
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representing each type of KIT sequence were initially sequenced and in those -cases 
where a discrepancy was observed (possibly due to PCR errors) additional clones 
were sequenced over those particular nucleotide sites. RT-PCR analysis of KIT exon 
16-19 was carried out with the primers KIT1F (5'-TCR TAC ATA GAA AGA GAY 
GTG ACT C) and KIT7R (5'-AGC CTT CCT TGA TCA TCT TGT AG). 

Results 

The sequence of the KIT gene coding region derived from an animal of the 
Hampshire Breed is shown in Figure 10. Differences between KIT cDNA 
sequences cloned from a Hampshire and a Yorkshire/Landrace pig, respectively 
are shown in the table below. The sequence comparison includes the whole open 
reading frame 2919 bp, except for the last 27 bp occupied by the reverse PCR 
primer. Exon and base pair position number as well as amino acid codon are given 
for each difference. Polymorphic bases are shown in bold. A dash indicates identity 
with the Hampshire (j) allele. 
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Example 17 
DNA preparation 

Genomic DNA was prepared as described in example 12. 
PCR 

A 158bp fragemnt covering 99bp of the end of exon 19 and 59bp of the KIT gene was 
amplified using forward primer LA93 (5'-GAG CAG CCC CTA CCC CGG AAT GCC 
AGT TGA-3') and reverse primer KIT56 (5'-CTT TAA AAC AGA ACA TAA AAG 
CGG AAA CAT CAT GCG AAG G-3'). PCR was carried out on a Perkin Elmer 9600 
Thermal Cycler in a total volume of 20^x1 containing 25ng genomic DNA, 1.5mM 
MgCI 2 , 50mM Kcl, lOmM Tris-HCl, pH 8.3, 200nM dNTPs, 0.5u AmpliTaq Gold 
(Perkin Elmer) and 10 pmol of both LA93 and KIT56 primer. To activate AmpliTaq 
Gold, initial heat denaturation was carried out at 94°C for 10 minutes followed by 32 
cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. 

Restriction digestion and electrophoresis 

The PCR amplification product is 158 bp in length. To test for polymorphism at 
position 93 of this product (corresponding to position 2678 of the KIT cDNA sequence) 
digestion reactions were set up and incubated as below: 

6.0^1 PCR product 

1 .Ojxl 1 Ox reaction buffer 3 (New England Biolabs) 
02\xl Acil(5u/\il) 
2.8yl deionised water 

Following digestion at 37°C for 120 minutes each 10^1 reaction volume had 2^1 of 
loading dye aded and the mix was loaded on an 8% native polyacrylamide gel 
(Protogel, 37.5:1 acrylamide:bisacrylarnide, National Diagnostics, Atlanta) in 0.5 x 
TBE (44.5 mM Tris pH8.0, 44.5 mM boric acid and 0.5 raM EDTA) and 
electrophoresed for 3 hours at 200v in a vertical slab gel unit (SE600 Hoefer Scientific 
Instruments). Products are visualised by ethidium bromide staining. 
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Results 

The reverse primer is designed such that an Acil site is introduced into the amplified 
sequence. This results in digestion of amplicon with Acil releasing a fragment of 23bp 
that allows confirmation of the digestion process. Digestion of the remaining 135 bp 
fragment into fragments of 92 and 43 bp is dependant on the nucleotide at the position 
corresponding to position 2678 of the KIT cDNA sequence. T at this position prevents 
digestion while a C at this position allows digestion. Gel resolution is not sufficient to 
allow resolution of the 23bp fragment but comparison to undigested product allows 
confirmation of the process. 

Figure 1 1 illustrates the results obtained with animals of a range of genotypes. 

The test was used to analyse a total of 66 unrelated individuals from seven breeds of 
pig. The results are shown in the table below: 



Breed 


No. 


KIT Genotype 1 


Genotype at pos'n 2678 










C/C 


or 


T/T 


Hampshire 


4 


i/i 


1 


l 


2 


Polish Wild Boar 


13 


i/i 


0 


i 


12 


Duroc 


11 


i/i 


0 


l 


10 


Pietrain 


1 


i/i 


0 


0 


1 


Swedish Wild Boar 


1 


i/i 


1 


0 


0 


Swedish Landrace 


12 


I/I 


0 


0 


12 




5 


Uf 


0 


2 


3 


Swedish Yorkshire 


14 


VI 


0 


1 


13 




5 


I/f 


0 


1 


4 



I Genotype based on Malll RFLP analysis as described in example 1 1. 



Example 1 8 
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Determination of genotype at the / locus using a rapid DNA based test 
Crude DNA lysates were prepared from hair samples from animals of three breeding 
lines, a Hampshire based line, a Large White line, and white animals from a cross bred 
line originally produced from the two former lines. Four hair follicles were placed into 
100^1 of K buffer (50mM KC1, lOmM Tris-HCl pH 8.3, 2.5mM MgCl 2 , 0.5% w/v 
Tween 20) and lfjl Proteinase K (15 mg/ml) (Boehringer) added. This mix was 
incubated for 2 hours at 55°C followed by 16 min at 95°C. DNA was also prepared as 
described in example 12. 

Allelic discrimination reactions were set up using the PE Applied Biosystems 
TaqMan™ system. 25jil reactions contained the primers E19FOR (5- 
GAGCAGCCCCTACCCCGGAATGCC AGTTGA-3 ') and E19REV (5- 
CTTTAAAAC AGAAC ATAAAAGCGGAAAC ATC ATGCGAAGG-3 ') at 300nM, 
8% glycerol (w/v) IX TaqMan™ buffer A (PE Applied Biosystems), 5mM MgCl 2 , 
200^iM dATP, dGTP, dCTP and dUTP, 0.65 units AmpHTaq Gold™ (PE Applied 
Biosystems), 0.25 units AmpErase™ UNG (PE Applied Biosystems) and the 
TaqMan™ probes E19PC (5'- CATACATTTCCGCAGGTGCATGC-FAM) and 
E19PT (5'- TCATACATTTCCACAGGTGCATGC-TET) at a concentration of 
lOOmM. lfil of crude lysate DNA was used as template. PCR amplification was 
carried out using a PE9600 thermal cycler (PE Applied Biosystems) or a the ABI7700 
Prism (PE Applied Biosystems) with a thermal cycling regime of 50°C for 2 min 
followed by 95°C for 10 min followed by 40 cycles of 95°C 15 sec, 62°C 1 min. 8 
control samples of each homozygote genotype, 2678C and 2678T, and 8 no template 
controls where deionized water was substituted for template controls were used per 96 
well plate. Allele identification based on these reactions was carried out using the 
allelic discrimination function of the ABI7700 Prism (PE Applied Biosystems). 

Results 

The test was used to analyse a total of 20 unrelated individuals from four breeds of pig. 
The results are shown in the table below: 



Breed 



No. Assumed KIT Genotype at pos'n 2678 
Genotype 
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C/C 



C/T 



T/T 



Hampshire 
Landrace 
Duroc 
Pietrain 



5 
5 
5 
1 



i/i 
VI 
i/i 
i/i 



1 
0 
0 
0 



1 

0 
0 
0 



3 
5 
5 
5 



Example 19 

Complete cosegregation of the Belt coat colour locus and KIT 
Method 

Hampshire pigs have a characteristic coat colour phenotype with a white belt on a 
solid black background. Belt is determined by a dominant allele {Be). The segregation 
of the Belt locus was investigated in a backcross between Hampshire (Be/Be) and 
Pietrain (be/be) pigs. Fl sows (Be/be) were back-crossed to pure-bred Pietrain (be/be) 
boars. DNA preparations were carried out exactly as described in Example 3. 

KIT exon 1 9 PCR RFLP 

i) PCR to produce DNA for the RFLP test 

A 158 bp fragment covering 99 bp of the 3' end of exon 19 and 59 bp of intron 19 of 

the KIT gene was amplified using the following primers: 

forward LA93 5' - GAGCAGCCCCTACCCCGGAATGCCAGTTGA -3' 

and reverse 

KIT56(5' -CTTTAAAACAGAACATAAAAGCGGAAACATCATGCGAAGG -3'). 
PCR was carried out in a total volume of 20 pi containg 25 ng genomic DNA, 1.5 
mM MgCl 2 , 50 mM KC1, 10 mM tris-HCl, pH 8.3, 200 uM dNTPs, 0.5 U AmpliTaq 
Gold (Perkin Elmer) and 10 pmol of both LA93 and KIT56 primer. To activate 
Amplitaq Gold, initial heat denaturation was carried out at 94°C for 10 minutes 
followed by 32 cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 
72°C. 
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ii) Restriction enzyme digestion and electrophoresis 

The PCR amplification product is 158 bp in length. To test for polymorphism at 
position 93 of this product, digestion reactions were set up and incubated as follows: 

6.0 ^il PCR amplified DNA 
1.0 nl 10XNEBuffer3 
0.2 ^c/I(5U/^l) 
2.8 ill dH20 

(1 X NEBuffer (New England Biolabs) contains 100 mM sodium chloride, 50 mM 
Tris-HCl, 10 mM magnesium chloride, and 1 mM DTT). Following digestion at 37°C 
for 120 minutes, two jxl loading dye was added to each sample and the mix loaded on 
a 12 % native polyacrylamide gel in 0.5 % TBE (44.5 mM Tris pH 8.0, 44.5 boric 
acid and 0.5 mM EDTA) and electrophoresed for 3 hours at 200 V in a vertical slab 
unit. Products were visualised by Ethidium bromide staining. 

Results 

KIT nucleotide 2678 is polymorphic and a C or T occurs at this position. The 
presence of a C creates a restriction site for Act I which is absent when a T is present. 
A second Acil site has been engineered into the reverse primer KIT56 to serve as an 
internal control of digestion and is therefore invariant The polymorphism can be 
detected by a simple PCR-RFLP analysis as described in the table below. 

Detection of KIT single nucleotide polymorphism (SNP) at position 2678 

Size in bp of DNA 
Nucleotide fragments after digestion 

C 23+43+92 
T 23 + 135 

The cosegregation of the Belt and KIT loci in this pedigree is summarised in the table 
below. 
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Cosegregation between KIT and Belt in a Hampshire/Pietrain backcross 



Animal No tested Phenotvpe Belt locus KIT SNP2678 



Fl sows 


14 


Belt 


Be/be 


C/T 


Pietrain sires 


2 


non-Belt 


be/be 


17T 


Offspring 


41 


Belt 


Be/be 


C/T 


Offspring 


41 


non-Belt 


be/be 


ITT 



The complete cosegregation between the Belt phenotype and the KIT polymorphism 
shows that this phenotype most likely is controlled by a mutation at the KIT locus. 
This means that detection of KIT polymorphism can be used to identify animal 
products derived from Hampshire pigs since the Belt is the most important breed 
determinant in Hampshire pigs. It is likely that the Belt phenotype present in 
Saddleback and Hannover-Braunschweig pigs is controlled by the same locus. 

Example 20 

Determination of the sequence of the 5' untranslated and 5* coding region of the 
aMSHR gene 

The entire coding region of the aMSHR gene was determined and compared between 
pig breeds known to carry the different at the E locus, and £F. Hampshire 

carries and has a solid black body interrupted with a white belt This belt is the 
result of another coat colour locus. The Wild Boar which carries allele E? has a 
wildtype phenotype while the Pietrain breed carries allele EF and is characterized by 
having black spots on a white body. 

PCR to produce DNA for clone construction 

The entire coding region of the aMSHR gene was amplified from genomic DNA 
using primers EPIG10 and EPIG16. These primers have sequence: 
EPIG10 5' - GGT CTA GAT CAC CAG GAG CAC TGC AGC ACC - 3' 

EPIG1 6 5' - GGG AAG CTT GAC CCC CGA GAG CGA CGC GCC - 3' 

PCR was carried out on a DNA thermal cycler (Perkin Elmer 9600) in a total volume 
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of 20 ^1 containing 25 ng genomic DNA, 1.5 mM MgCl 2 , 50 mM KC1, 10 mM Tris- 
HC1, pH 8.3, 200 i*M dNTPs, 5.0 % DMSO (dimethyl sulfoxide), 0.5 U AmpliTaq 
Gold (Perkin Elmer) and 10 pmol of both EPIG10 and EPIG16. To activate AmpliTaq 
Gold, initial heat denaturation was carried out at 96°C for 10 minutes followed by 32 
cycles each consisting of 45 sec at 94°C, 45 sec at 55°C and 45 sec at 72°C. The final 
extension lasted for 7 min at 72°C. 

Cloning of PCR products 

To facilitate cloning of PCR products, both primers were designed with restriction 
endonuclease recognition sites located at the 5' end. Primer EPIG10 has sequence 
TCTAGA which is cut using enzyme Xbal and EPIG16 contains sequence AAGCTT 
which is cut using enzyme HindUI. Following PCR as described above, the entire 
reaction volume was electrophoresed and purified using the Qiaex II gel extraction kit 
following the manufacturers instructions (Qiagen). The purified PCR product was 



digested prior to ligation as follows: 

PCR product 17.0^1 

5.0ui//>idIII (Amersham) 1.0 ^1 

5 .0u Xbal (Amersham) 1.0 p.1 

x 1 0 reaction buffer M (Amersham) 3 .0 ^il 
xlO bovine serum albumin (Amersham) 3.0 pi 

H 2 Q 5.0 Kil 



The reactions were incubated at 37 degrees C for 16 hours before the digested DNA 
was purified by passage through a QIAquick spin column following the 
manufacturers instructions (Qiagen). PCR products were ligated into 100 ng of vector 
pRc/CMV (Invitrogen) using 400 U T4 DNA ligase (New England Biolabs) in a total 
reaction volume of 20 \x\ containing 10 mM MgCl 2 , 50 mM Tris-HCl, pH 7.5, 10 mM 
dithiothreitol, 1 mM ATP and 25 \ig/m\ bovine serum albumin. Ligation reactions 
proceeded at 16 degrees for 16 hours. 

Preparation and Sequencing of Plasmid DNA 

Plasmid DNA was purified from overnight bacterial culture using the Jetstar plasmid 
midi kit 50 (Genomed) and the resulting DNA diluted to 15 ng/^1. Dye terminator 
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cycle sequencing was performed using AmpliTaq DNA polymerase in accordance 
with the ABI Prism protocol P/N 402078 (perkin Elmer). Cycle sequencing reactions 
comprised 1.6 pmoie of either T7 or SP6 sequencing primer (Promega), 15 ng of 
plasmid DNA and the terminator ready reaction mix (Perkin Elmer). The cycle 
sequencing reactions were performed in a GeneAmp 9600 machine over 25 cycles, 
each consisting of 10 sec at 96°C, 5 sec at 50°C and 4 min at 60°C. Extension 
products were purified for gel separation using ethanol precipitation, loaded and run 
on a 377 ABI Prism DNA sequencer as described by the instrument protocol (Perkin 
Elmer protocol P/N 402178). 

Results 

A 2bp insertion was identified in the aMSHR gene of pigs of the Pietrain breed which 
carry the EF allele between nucleotide positions equivalent to 66 and 67 in the Wild 
Boar aMSHR sequence. This results in a shift in the translation frame and creates a 
TGA stop codon at nucleotide positions equivalent to 161 to 163 in the Wild Boar 
aMSHR sequence. The 5 5 portion of the aMSHR coding sequence compared 
between three breeds is shown below. This comparison illustrates the two base pair 
insertion present within the alleles carried by the Pietrain animal when compared with 
either the Hampshire or Wild Boar alleles.. The ATG start codon is highlighted in 
bold, the 3' end of primer EPIG16 is shown in italics and bases in common with the 
Pietrain sequence are marked with a dash. Missing bases are marked with :. 

Pietrain CGACGCGCCC TCCCTGCTCC CTGGCGGGAC GATGCCTGTG CTTGGCCCGG 

Meishan 



Wild Boar 

Pietrain 
Meishan 
Wild Boar 

Pietrain 
Meishan 
Wild Boar 





AGAGGAGGCT 


GCTGGCTTCC 


CTCAGCTCCG 


CGCCCCCAGC 


CGCCCCCCCC 




GCCTCGGGCT 


GGCCGCCAAC 


CAGACCAACC 


AGACGGGCCC 


CCAGTGCCTG 



Pietrain GAGGTGTCCA TT 

Meishan — 

Wild Boar — 

These results are also incorporated into figure la. 



Example 21 
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A rapid DNA test for the presence of the 2bp insertion mutation in the porcine 
aMSHR gene allowing rapid distinction of the E? allele from all other alleles 
identified at this locus 

PCR was conducted with forward primer EPIG16 (see above) and reverse primer 
MC1R121A exactly as described above. The reverse primer was labeled with ABI 
dye Hex and has sequence: 

MC1R121A 5' - Hex- GGA CTC CAT GGA GCC GCA GAT GAG CAC GGT - 

3' 

Following PCR cycling, 0.2 p.1 of the reaction volume was mixed with 2.5 p.1 of 
deionised formamide, 0.5 |xl of GS500 DNA standard (ABI) and 0.4 fxl blue dextran 
solution before being heated to 90°C for 2 minutes and rapidly cooled on ice. 1 \il of 
this mix was then loaded onto a 377 ABI Prism sequencer and the DNA fragments 
separated on a 6% polyacrylamide gel in 1 X TBE buffer for 2 hours at 700 V, 40 
mA, 32 W. The length of the resulting PCR products were determined using the 
GeneScan software (ABI). 

Results 

A test was devised to assay genomic DNA directly for the presence of the identified 2 
bp insertion. Primers EPIG16 and MC1R121A were used to PCR amplify 448 bp of 
the 5' portion of the wildtype porcine MC1R gene. To facilitate fluorescent detection 
of amplified products, the ABI dye HEX was covalently attached to the 5' end of 
primer MC1R121A. PCR was conducted on a number of unrelated individuals from 
three breeds and the resulting PCR products size determined using the GeneScan 
software (ABI). The results are presented in the table below: 



Breed Number Extension Size of PCR product 

tested Genotype 

448 bp 450 bp 



Pietrain 5 EPIEF 0 5 

Large White 3 ff IE? 0 3 

Hampshire 5 5 0 
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Analysis of the length of PCR products amplified between individuals showed either a 
448 bp or 450 bp product resulted. The expected 448 bp fragment was amplified from 
each H a mpshir e animal, however a product 2 bp longer was detected from each 
Pietrain and Large White animal tested. This indicates the 2 bp insertion identified via 
sequence analysis to be present in the genomic DNA of Pietrain and Large White, two 
breeds ascribed the E? allele, but not in Hampshire which carries ET. 

EXAMPLE 22 

In addition to die coding region of the aMSHR gene, DNA sequence polymorphism 
may exist between breeds within the untranslated regions (UTR). Sequence 
information was collected from the 3' UTR and compared between six breeds of pigs 
which display a variety of coat color phenotypes. 

PCR to produce DNA for sequencing 

A 454 bp product containing 38 coding nucleotides from the 3* portion of the 
molecule and 416 bp of 3* untranslated region {not including primer binding sites) 
was amplified using primers EPIG13 and EPIG14. These primers have sequence: 
EPIG1 3 5* - GCA AGA CCC TCC AGG AGG TG - 3* 
EPIG14 5* - CAC TGA GCC GTA GAAGAG AG- 3' 

PCR was carried out on a DNA thermal cycler (Peririn Elmer 9600) in a total volume 
of 20 yl containing 25 ng genomic DNA, 1.5 rnM MgCl* 50 mM KC1, 10 mM Tris- 
HC1, pH 8.3, 200 dNTPs, 0.5 U AmpliTaq Gold (Perkin Elmer) and 10 pmol of 
both EPIG13 and EPIG14. To activate AmpliTaq Gold, initial heat denaturation was 
carried out at 96°C for 10 minutes followed by 32 cycles each consisting of 45 sec at 
94°C, 45 sec at 55°C and 45 sec at 72°C. The final extension lasted for 7 min at 72°C. 

Sequencing of PCR products 

PCR products were sequenced using dye terminator chemisty. This first requires 
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purification of PCR product free from excess dNTPs and primers. This was achieved 
by passage of the template DNA through a QIAquick spin column following the 
manufacturers instructions (Qiagen). Cycle sequencing reactions were performed on 
20 ng of purified template using either EPIG13 or EPIG14 as described in Example 1. 

Results 

Primers EPIG13 and EPIG14 were used to amplify a 454 bp region of DNA which 
comprises both the 3' terminal coding region of the MC1R gene and the immediately 
adjacent 3 s UTR. The sequence information collected is displayed in figure 15 and 
two polymorphic positions were identified. 

The first polymorphism identified is a 1 bp deletion common to Meishan and Large 
Black which occurs seven positions downstream from the stop codon at the equivalent 
of nucleotide position 1007 in the Wild Boar sequence. As this deletion occurs outside 
the translated region it is not expected to alter the amino acid composition of the 
resulting receptor molecule, however its influence on mRNA stability and 3' end 
formation through endonucleolytic cleavage is unknown. It is unique to two breeds 
which carry ET and which carry a variant of the aMSHR gene not found in any other 
breed examined to date. 

The second polymorphism is a base substitution at nucleotide position 1 162 unique to 
the European Wild boar. Figure 15 shows five breeds to have a G base at this position 
and the Wild boar to contain an A. This sequence difference offers the possibility to 
distinguish the European Wild boar from the other breeds analysed with a DNA based 
test. 
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1. A method for 

(a) differentiating animals and animal products on the basis of breed origin; or 

(b) determining or testing the breed origin of an animal product; or 

(c) validating an animal product; 
comprising the steps of: 

(i) providing a sample of the animal product; and 

(ii) analysing die allele(s) of one or more breed determinant genes present in 
the sample. 

2. The method of claim 1 wherein the breed determinant is a monogenic trait 

3. The method of claim 1 wherein the breed determinant is a polygenic trait 

4. The method of any one of claims 1-3 wherein the overt phenotypic trait is a 
behavioural or morphological trait 

5. The method of claim 3 or claim 4 wherein the overt phenotypic trait varies 
qualitatively or quantitatively between breeds. 

6. The method of any one of the preceding claims wherein the breed determinant 
gene analysed in step (ii) is selected from any of: 

(a) a coat colour gene; and/or 

(b) a coat pattern gene; and/or 

(c) a coat texture gene; and/or 

(d) a coat density gene; and/or 

(e) a coat length gene; and/or 

(f) an ear aspect gene; and/or 

(g) a double muscling gene; and/or 

(h) a horn morphology gene; and/or 

(i) a tusk morphology gene; and/or 
(j) an eye colour gene; and/or 
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(k) a plumage gene; and/or 
(1) a beak colour/morphology gene; and/or 
(m) a vocalization (e.g. barking) gene; and/or 
(n) a comb or wattle gene; and/or 

(0) a gene controlling display behaviour. 

7. The method of claim 6(a) wherein the coat colour gene is the KIT or aMSHR gene 
(for example, the pig KIT or aMSHR gene). 

8. The method of any one of the preceding claims wherein the sample is a nucleic 
acid sample and the analysing step (ii) comprises DNA or RNA analysis. 

9. The method of any one of claims 1-7 wherein the sample is a protein sample and 
the analysing step (ii) comprises protein analysis. 

10. A method of determining the coat colour genotype of a pig which comprises: 

(1) obtaining a sample of pig nucleic acid; and 

(ii) analysing the nucleic acid obtained in (i) to determine which allele or 
alleles of the aMSHR gene is/are present 

11. The method of claim 8 or claim 10 wherein the analysis step (ii) comprises: 

(a) selectively amplifying a specific fragment of nucleic acid (e.g. by PCR); 
and/or 

(b) testing for the presence of one or more restriction endonuclease sites 
within the breed determinant gene(s)AxMSHR gene (e.g. restriction fragment 
length polymorphism (RFLP) analysis); and/or 

(c) determining the nucleotide sequence of all or a portion of the breed 
* determinant gene(s)/aMSHR gene; and/or 

(d) probing the nucleic acid sample with an allele-specific DNA or RNA 
probe; and/or 

(e) carrying out one or more PCR amplification cycles of the nucleic acid 
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sample using at least one pair of suitable primers and then carrying out RFLP 
analysis on the amplified nucleic acid so obtained. 

12. A method of determining the coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig aMSHR protein; and 

(ii) analysing the protein obtained in step (i) to determine the amino acid 
sequence at those positions associated with coat colour genotype or the size of the 
protein 

13. The method of claim 9 or claim 12 wherein the analysis step (ii) comprises: 

(a) probing the protein sample with an antibody (e.g. a monoclonal antibody) 
specific for an allele-specific epitope; and/or 

(b) electrophoretic analysis; and/or 

(c) chromatographic analysis; and/or 

(d) amino-acid sequence analysis; and/or 

(e) proteolytic cleavage analysis; and/or 

(f) epitope mapping and or 

(g) translating a copy of the DNA or RNA of die gene produced by PCR or 
other means in an in-vitro trancriptionAranslation system 

14. The method of claim 7 wherein the analysis step (ii) comprises determining the 
nucleotide sequence of the KIT or aMSHR gene or die amino acid sequence of the 
KIT or aMSHR protein. 

15. The method of claim 7 or claim 14 wherein the analysis step (ii) comprises 
establishing the presence or absence of at least one nucleotide change in the KIT or 
aMSHR gene and/or their flanking regions. 

1 6. The method of claim 10 or claim 1 1 wherein the determination in step (ii) 
involves identifying the presence or absence of at least one missense mutation, 
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insertion or deletion in the the ctMSHR gene and/or it's associated flanking regions. 

17. The method of any one of claims 7, 10, 1 1, 14 and 15 wherein the analysis step 
(ii) further comprises determining the association between one or more microsatellite 
or other linked marker alleles linked to the KIT ox aMSHR gene and to particular 
alleles of the KIT ox aMSHR gene. 

1 8. The method of claim 17 wherein the analysis step (ii) is based on the 
identification of microsatellite markers present in the nucleic acid sample. 

1 9. The method of claim 7 wherein the analysis step (ii) comprises: 

(a) determining the association between one or more microsatellite or other 
linked marker alleles linked to the ATT or aMSHR gene and to particular 
alleles of the ATT or aMSHR gene; 

(b) determining which microsatellite or other linked marker allele or alleles 
are present in the nucleic acid sample. 

20. A method of determining the coat colour genotype of a pig which comprises: 

(i) determining the association between one or more microsatellite or other 
linked marker alleles linked to the aMSHR gene and particular alleles of the 
aMSHR gene; 

(ii) obtaining a sample of pig nucleic acid: and 

(iii) analysing the nucleic acid obtained in (ii) to determine which 
microsatellite or other linked marker allele or alleles are present. 

21. The method of any one of claims 7, 10, 1 1 and 14-20 wherein the analysis step 
(ii) further comprises the step of determining the genotype of at least one additional 
locus. 
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22. The method of claim 21 wherein the additional locus is an additional coat colour 
locus. 

23. The method of claim 22 wherein the additional coat colour locus is the KIT gene 
locus (e.g. the pig KIT gene locus). 

24. The method of claim 23 wherein the ATT gene locus is analysed to determine 
whether it carries any polymorphism associated with Belt genotype. 

25. The method of claim 24 wherein the determination comprises RFLP analysis. 
26 A method of determining the coat colour genotype of a pig which comprises: 

(i) obtaining a sample of pig nucleic acid: and 

(ii) analysing the nucleic acid obtained in (i) to determine whether the KIT 
gene carries any polymorphism associated with Belt genotype. 

27. A method as claimed in claim 26 wherein step (ii) comprises RFLP analysis. 

28. A method as claimed in claim 26 or claim 27 wherein a sample of pig genomic 
DNA is amplified using PCR and a pair of suitable primers. 

29. The method of claim 21 wherein the additional locus is a breed determinant 
gene locus selected from any of those genes specified in claim 6. 

30. The method of claim 21 wherein the additional locus is a breed specific marker. 

3 1 . The method of claim 30 wherein the breed specific marker is a microsatellite 
marker. 

32. The method of any one of claims 7, 10, 1 1, 14-23 and 28-3 1 wherein the analysis 
step (ii) comprises PCR using at least one pair of suitable primers. 
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33. The method of claim 32 wherein the gene is the pig clMSHR gene and at least one 
pair of suitable primers is: 

ctMSHR Forward Primer 1 : (5'-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT -3'); 

ctMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 

aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3'); or 
aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
aMSHR Reverse Primer 3: (5'-GGG GCA GAG GAC GAC GAG GGA GAG-3'). 

34. The method of any one of claims 7, 10, 1 1, 14-23 and 28-33 wherein the analysis 
step (ii) comprises restriction fragment length polymorphism (RFLP) analysis, for 
example involving digesting the pig nucleic acid with one or more of the restriction 
enzymes ArfUI, Hhal and/or BspVU. 

35. The method of claim 34 wherein the gene is the pig aMSHR gene and the 
analysis involves identification of a polymorphism at nucleotide position 283, 305, 
363, 370, 491, 727, 729 1 162 or between nucleotide positions 60 and 70 or between 
nucleotide positions 1005 and lOlOof the sequence of the pig aMSHR gene. 

36. The method of claim 7 wherein the analysis step (ii) carrying out one or more 
PCR amplification cycles of the nucleic acid sample using at least one pair of suitable 
primers and then carrying out RFLP analysis on the amplified nucleic acid so 
obtained to determine the KIT or aMSHR genotype of the pig. 

37. The method of claim 36 wherein the gene is the pig aMSHR gene and the at least 
one pair of suitable primers is as defined in claim 35. 

38. The method of claim 30 or 31 wherein the gene is the pig ATT or aMSHR gene 
and the RFLP analysis is as defined in claim 28. 
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39. The method of any one of claims 1-9, 1 1, 13-19, 21 -25 and 29-38 wherein the 
animal product is meat (e.g. processed and/or canned meat), egg, egg swab or 
washing, semen, wool or learner. 

40. The method of any one of claims 1-9, 1 1, 13-19 and 21-39 wherein the sample 
comprises genomic DNA, RNA or mitochondrial DNA. 

41. The method of any one of claims 1-9, 1 1, 13-19 and 21-40 wherein the animal is 
a mammal (e.g. pig, cattle, dog, cat, horse, sheep, rodent or rabbit), fish (e.g. salmon 
or trout) or bird (chicken or turkey). 

42. A kit for: 

(a) differentiating animal products on the basis of breed origin; or 

(b) determining or testing the breed origin of an animal product; or 

(c) validating an animal product; 

comprising one or more reagents for analysing the alleles) of one or more breed 
dete rminant genes present in the sampl e 

43. A kit for determining the coat colour genotype of a pig, comprising one or 
more reagents for analysing the aMSHR genotype of the pig. 

44. A kit as claimed in claim 42 or claim 43 which is adapted to be used with a 
sample of pig genomic DNA. 

45. A kit as claimed in any on of claims 42 to 44 comprising one or more reagents 
for tarrying out at least one cycle of PCR together with at least one pair of suitable 
primers. 

46. A kit as claimed in claim 45 wherein the atleast one pair of suitable primers is: 

aMSHR Forward Primer 1 : (5"-TGT AAA ACG ACG GCC AGT RGT GCC TGG 
AGG TGT CCA T-30 



SUBSTITUTE SHEET (RULE 26) 



.WO 98/54360 



PCT/GB98/01531 



79 

aMSHR Reverse Primer 5: (5'-CGC CCA GAT GGC CGC GAT GGA CCG-3'); or 
aMSHR Forward Primer 2: (5'-CGG CCA TCT GGG CGG GCA GCG TGC-3') 
aMSHR Reverse Primer 2: (5'-GGA AGG CGT AGA TGA GGG GGT CCA-3*); or 
aMSHR Forward Primer 3: (5'-GCA CAT CGC CCG GCT CCA CAA GAC-3') 
aMSHR Reverse Primer 3: (5*-GGG GCA GAG GAC GAC GAG GGA GAG-3'). 

47. A kit as claimed in any one of claims 42 to 46 which comprises one or more 
reagents for RFLP analysis of pig nucleic acid. 
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FIG.1 

a: Nucleotide sequence 



1 27 
Wild Soar CTCCCTGCTCCCTGCTCCCTGGCGGGACG ATG CCT GTG CTT GGC CCG GAG AGG AGG 



Me is nan 
Pietrain 



75 

Wild Boar CTG CTG GCT TCC CTC AGC TCC GCG CCC CCA GCC GCC CCC ** CGG CCG CCA 

Meishan + * 

Pietrain 



CC 



126 

Wild Boar ACG CCT CGG GCT CAG ACC AAC CAG ACG GGC CCC CAG TGC CTG GAG GTG TCC 

Meishan _ 

Pietrain 

177 

Wildboar ATT CCC GAC GGG CTC TTC CTC AGC CTG GGG CTG GTG AGC CTC GTG GAG AAC 

Meishan 

Pietrain 

Largewhite _ ' '/ " 

Hampshire 

Duroc ... 



228 

Wildboar GTG CTG GTG GTG GCC GCC ATC GCC AAG AAC CGC AAC CTG CAC TCG CCC ATG 

Meishan 

Pietrain * * " " * 

Largewhite 

Hampshire 

Duroc t 

279 

Wildboar TAC TAC TTC GTC TGC TGC CTG GCC GTG TCG GAC CTG CTG GTG AGC GTG AGC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc 



329 

Wildboar AAC GTG CTG GAG ACG GCC GTG CTG CTG CTG CTG GAG GCG GGC GCC CTG GCC 
Meishan ... A r 

Pietrain [ * ] * " * " * * * * * * * * * * ' * * * * * * * 

Largewhite [][ [/ 

Hampshire 

Duroc , * * ' * * ~ ~ * " * ' * * * * * * * " * * * * " * " * ' ' " " * * 



380 

Wildboar GCC CAG GCC GCC GTG GTG CAG CAG CTG GAC AAT GTC ATG GAC GTG CTC ATC 
Meishan 

Pietrain " ][[ '/[ [' ' ' ' ' " 

Largewhite 

Hampshire 

Duroc . . . 
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431 

Wiidboar TGC GGC TCC ATG GTG TCC AGC CTC TGC TTC CTG GGC GCC ATC GCC GTG GAC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc 

483 

Wiidboar CGC TAC GTG TCC ATC TTC TAC GCG CTG CGC TAC CAC AGC ATC GTG ACG CTG 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc _ . . 



534 

Wiidboar CCC CGC GCG GGG CGG GCT ATC GCG GCG ATC TGG GCG GGC AGC GTG' CTC TCC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc t . .. . . 



585 

Wiidboar AGC ACC CTC TTC ATC GCC TAC TAC CAC CAC ACG GCC GTC CTG CTG GGC CTC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc 

636 

Wiidboar GTC AGC TTC TTC GTG GCC ATG CTG GCG CTC ATG GCG GTA CTG TAC GTC CAC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc 



687 

Wiidboar ATG CTG GCC CGG GCC TGC CAG CAC GGC CGG CAC ATC GCC CGG CTC CAC AAG 
Meishan 

Pietrain [ [ * * [][ * * '[[ \ \ 

Largewhite 

Hampshire 

Duroc 



738 

Wiidboar ACG CAG CAC CCC ACC CGC CAG GGC TGC GGC CTC AAG GGC GCG GCC ACC CTC 

Meishan _ . .A - . . 

Pietrain _ _\ [[] " * * * 

Largewhite [ [ ] \ * * * * * ' " * * * * * * * * * * 

Hampshire ^ *^ \ ' ^ * * * * * * * * * [ ' * * * [ 

Duroc ...... n 
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Wildboar ACC ATC CTG CTG GGC GTC TTC CTC CTC TGC TGG CCA CCC TTC TTC CTG CAC 

Meishar. 

Pietrain 

Largewhite 

Hampshire 

Duroc 

840 

Wildboar CTC TCC CTC GTC GTC CTC TGC CCC CAG CAC CCC ACC TGC GGC TGC GTC TTC 
Me is nan 

Pietrain 

Largewhite 

Hampshire 

Duroc 

885 

Wildboar AAG AAC GTC AAC CTC TTT CTG GCC CTC GTC ATC TGC AAC TCC ATC 

Meishan 

Pietrain 

Largewhite 

Hampshire 

Duroc 



b. Amino acid sequence 

99 

Wildboar 7PNGLFLSLG LVSLVENVLV VAAIAKNRNL HSPMYYFVCC LAVSDLLVSV 5NVLETAVLL 

Meishan M. ..... P 

Largewhite 

Hampshire 

Duroc 



Wi ldboar 

Meishan 

Largewhite 

Hampshire 

Duroc 



159 

LLEAGALAAQ AAVVQQLDNV MDVLICGSMV SSLCFLGAIA VDRYVSI FYA LRYHSIVTLP 



. N . 



Wildboar 

Meishan 

Largewhite 

Hampshire 

Duroc 



219 

RAGRA1AAIW AGSVLSSTLF IAYYHHTAVL LGLVSFFVAM LALMAVLYVH MLARACQHGR 



279 

Wildboar HIARLHKTQH PTRQGCGLKG AATLTILLGV FLLCWAPFFL HLSLVVLCPQ HPTCGCVFKN 
Meishan 

Largewhite ...'.['.[['.[ //...[ [[ .......... 

Hampshire 

Duroc T 



Wildboar 

Meishan 

Largewhite 

Hampshire 

Duroc 



VNLFLALVIC NSI 
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Sequence Alignment Across the Exon /Intron Border of KIT Exon 17 

Allele Gene Copy Sequence 

Exon 17 Intron 17 

I KIT1 AAT TAC GTG GTC AAA GGA AAC j GTG AGT ACC CAC GCT CTC CTG ACA GTC 
KIT2 |A 

I P KIT1 |G 

kiti ;:; ";ig;:\;:\;;\;;\;;\;;\;;\;;\;;* 

i KITI . 



FIG. 5 
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FIG. 6C 
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Ratio of normal to splice mutant KIT 
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Ratios for Landrace and Large White 




♦ ratio 



Landrace Breed Large White 

FIG. 8 
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FIG. 9 
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1 

ATG AGA GGC GCT CGC CGC GCC TGG GAT TTT CTC TTC GTC CTG CAG CTC TTG 
52 

CTT CGC GTC CAG ACA GGC TCT TCT CAG CCA TCT GTG AGT CCA GAG GAA CTG 
103 

TCT CCA CCA TCC ATC CAT CCA GCA AAA TCA GAG TTA ATC GTC AGT GCT GGC 
154 

GAT GAG ATT AGG CTG TTC TGC ACC GAT CCA GGA TCT GTC AAA TGG ACT TTT 
205 

GAG ACC CTG GGT CAG CTG AGT GAG AAT ACA CAC GCA GAG TGG ATC GTG GAG 
256 

AAA GCA GAG GCC ATG AAT ACA GGC AAT TAT ACA TGC ACC AAT GAA GGC GGT 
307 

TTA AGC AGT TCC ATT TAT GTG TTT GTT AGA GAT CCT GAG AAG CTT TTC CTC 
358 

GTC GAC CCT CCC TTG TAT GGG AAG GAG GAC AAT GAC GCG CTG GTC CGA TGT 
409 

CCT CTG ACG GAC CCA GAG GTG ACC AAT TAC TCC CTC ACG GGC TGC GAG GGG 
460 

AAA CCC CTT CCC AAG GAT TTG ACC TTC GTC GCG GAC CCC AAG GCC GGC ATC ■ 
511 

ACC ATC AGA AAC GTG AAG CGC GAG TAT CAT CGG CTC TGT CTC CAC TGC TCC 
562 

GCC AAC CAG GGG GGC AAG TCC GTG CTG TCG AAG AAA TTC ACC CTG AAA GTG 
613 

AGG GCA GCC ATC AGA GCT GTA CCT GTT GTG GCT GTG TCC AAA GCA AGC TAC 
664 

CTT CTC AGG GAA GGG GAG GAA TTT GCC GTG ATG TGC TTG ATC AAA GAC GTG 
715 

TCT AGT TCC GTG GAC TCC ATG TGG ATC AGG GAG AAC AGC CAG ACT AAA GCA 
766 

CAG GTG AAG AGG AAT AGC TGG CAT CAG GGT GAC TTC AAT TTT CTG CGG CAG 
817 

GAA AGG CTG ACA ATC AGC TCA GCA AGA GTT AAT GAT TCT GGC GTG TTC ATG 
868 

TGT TAC GCC AAT AAT ACT TTT GGA TCT GCA AAT GTC ACA ACC ACC TTA GAA 
919 

GTA GTA GAT AAA GGA TTC ATT AAT ATC TTC CCT ATG ATG AAT ACC ACT GTG 
970 

TTT GTA AAC GAT GGA GAG GAT GTG GAT CTA ATT GTT GAG TAC GAG GCG TAC 
1021 

CCC AAA CCT GAA CAC CGA CAG TGG ATA TAT ATG AAC CGC ACT GCC ACT GAT 
1072 

AAG TGG GAG GAT TAT CCC AAG TCT GAG AAT GAA AGT AAC ATC AGA TAT GTA 
1123 

AGT GAA CTT CAC TTG ACC AGA TTA AAA GGG ACC GAA GGA GGC ACT TAC ACA 
1174 

TTT CTC GTG TCC AAT GCT GAT GTC AAT TCT TCT GTG ACA TTT AAT GTT TAC 
1225 

GTG AAC ACA AAA CCA GAA ATC CTG ACT CAT GAC AGG CTC ATG AAC GGC ATG 
1276 

CTC CAG TGT GTG GCG GCA GGC TTC CCA GAG CCC ACC ATC GAT TGG TAT TTC 
1327 

TGT CCA GGC ACC GAG CAG AGA TGT TCC GTT CCC GTT GGG CCA GTG GAC GTG 
1378 

CAG ATC CAA AAC TCA TCT GTA TCA CCG TTT GGA AAA CTA GTG ATT CAC AGC 
1429 
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TCC ATT GAT TAC AGT GCA TTC AAA CAC AAC GGC ACG GTG GAG TGC AGG GCT 
1480 

TAC AAC GAT GTG GGC AAG AGT TCT GCC TTT TTT AAC TTT GCA TTT AAA GAA 
1531 

CAA ATC CAT GCC CAC ACC CTC TTC ACG CCT TTG CTG ATT GGT TTT GTG ATC 
1582 

GCA GCG GGT ATG ATG TGT ATC ATC GTG ATG ATT CTC ACC TAT AAA TAT CTA 
1633 

CAG AAG CCC ATG TAT GAA GTA CAG TGG AAG GTT GTC GAG GAG ATA AAT GGA 
1684 

AAC AAT TAT GTC TAC ATA GAC CCA ACG CAA CTT CCT TAT GAT CAC AAA TGG 
1735 

GAA TTT CCC AGG AAC AGG CTG AGT TTT GGC AAA ACC TTG GGT GCT GGC GCC 
1786 

TTC GGG AAA GTC GTT GAG GCC ACT GCA TAC GGC TTA ATT AAG TCA GAT GCG 
1837 

GCC ATG ACC GTT GCC GTG AAG ATG CTC AAA CCA AGT GCC CAT TTA ACG GAA 
1888 

CGA GAA GCC CTA ATG TCT GAA CTC AAA GTC TTA AGT TAC CTC GGT AAT CAC 
1939 

ATG AAT ATT GTG AAT CTT CTC GGC GCC TGC ACC ATT GGA GGG CCC ACC CTG 
1990 

GTC ATT ACA GAA TAT TGT TGC TAT GGT GAT CTC CTG AAT TTT TTG AGA CGG 
2041 

AAA CGT GAT TCG TTT ATT TGC TCA AAG CAG GAA GAT CAC GCA GAA GCG GCG 
2092 

CTT TAT AAG AAC CTT CTG CAT TCA AAG GAG TCT TCC TGC AGT GAC AGT ACT 
2143 

AAC GAG TAC ATG GAC ATG AAA CCC GGA GTG TCT TAT GTG GTA CCA ACC AAG 
2194 

GCA GAC AAA AGG * AGA TCT GCG AGA ATA GGC TCA TAC ATA GAA CGA GAT GTG 
2245 

ACT CCT GCC ATC ATG GAA GAT GAT GAG TTG GCC CTA GAC CTG GAG GAC TTG 
2296 

CTC AGC TTT TCT TAC CAA GTG GCA AAG GGC ATG GCC TTC CTC GCC TCG AAG 
2347 

AAT TGT ATT CAC AGA GAC TTG GCG GCC AGA AAT ATC CTC CTT ACT CAT GGT 
2398 

CGA ATC ACA AAG ATT TGT GAT TTT GGT CTA GCC AGA GAC ATC AAG AAT GAT 
2449 

TCT AAT TAC GTG GTC AAA GGA AAC GCT CGG CTA CCC GTG AAG TGG ATG GCA 
2500 

CCT GAG AGC ATT TTC AAC TGT GTC TAC ACA TTT GAA AGC GAT GTC TGG TCC 
2551 

TAT GGG ATT TTT CTG TGG GAG CTC TTC TCT TTA GGG AGC AGC CCC TAC CCC 
2602 

GGA ATG CCA GTT GAT TCT AAA TTC TAC AAG ATG ATC AAG GAG GGT TTC CGA 
2653 

ATG CTC AGC CCT GAG CAT GCA CCT GCG GAA ATG TAT GAC ATC ATG AAG ACT 
2704 

TGC TGG GAT GCG GAT CCC CTC AAA AGA CCA ACG TTT AAG CAG ATC GTG CAG 
2755 

CTG ATT GAG AAG CAG ATT TCG GAG AGC ACC AAT CAC ATT TAT TCC AAC TTA 
2806 

GCG AAC TGC AGC CCC CAC CGG GAG AAC CCC GCG GTG GAT CAT TCT GTG CGG 
2857 

ATC AAC TCC GTG GGC AGC AGT GCC TCC TCC ACG CAG CCT CTG CTT GTC CAC 
2908 

GAA GAT GTC TGA 
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1012 

Wild Boar CTGCAGTGCT CCTGGTGAGG GGGGACGGGC 

Meishan : 

Large Black : 

Hampshire 

Pietrain 

Duroc 

1062 

Wild Boar GCTGGAGCCA GGCTGCGGGG CTGAGGGCAG TGGTGCCGTC CTGCGGCCCG 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

1112 

Wild Boar GTTCCTACGT GGCTGGGCAG CCCCTTGGCA GAGAGGACGG GCCGGACATC 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 



1162 

Wild Boar TCTGAAGGTA TGGACGCTGG ACCCTCTGGG GCCCGACAGA GGAAGAGCCA 

Meishan G 

Large Black G 

Hampshire G 

Pietrain G 

Duroc G 

1212 

Wild Boar GCACTTCCAG GAGGCATGGG GAGTGGGGGA GGCTGGAGAG ACGGCGGGGA 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

1262 

Wild Boar GCGCCACCTC CATCCAGAGA CCACCACGCC CGCCTTTGGG -GCGCGCTCTG 
Meishan 



Large Black 

Hampshire 

Pietrain 

Duroc 

FIG. 12 
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1312 



Wild Boar GGGACTTTGC CCCCCACTGG GGTGGGACGT G T GCGGGC AG AAGCTGTCCG 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 



Wild Boar GGTGTTGCTC ACTGCAGGAC CTCAGGGGAA GGCCTTCGTG ACTGCTAGGA 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 



Wild Boar AGCAGGCGCA GCGCCCCGGC GGAGGGCGGG GCCCCTCTCT TCTACGGCTC 

Meishan 

Large Black 

Hampshire 

Pietrain 

Duroc 

Wild Boar AGTG 

Meishan 

Large Black 

Hampshire 

Duroc 



1362 



1412 
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